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Preface 


This book aims to provide a broad introduction to quantification issues of 
risk management. The main function of the book is to present concepts 
and techniques in the assessment of risk and the forms that the aggregate 
risk may be distributed between business units. The book is the result of 
our research projects and professional collaborations with the financial and 
insurance sectors over last years. The textbook is intended to give a set of 
technical tools to assist industry practitioners to take decisions in their pro- 
fessional environments. We assume that the reader is familiar with financial 
and actuarial mathematics and statistics at graduate level. 

This book is structured in two parts to facilitate reading: (I) Risk assessment, 
and (II) Capital allocation problems. Part (I) is dedicated to investigate risk 
measures and the implicit risk attitude in the choice ofa particular risk mea- 
sure, from a quantitative point of view. Part (II) is devoted to provide an 
overview on capital allocation problems and to highlight quantitative meth- 
ods and techniques to deal with these problems. Illustrative examples of 
quantitative analysis are developed in the programming language R. Exam- 
ples are devised to reflect some real problems that practitioners must fre- 
quently face in the financial or the insurance sectors. A collection of com- 
plementary material to the book is available in http: //www. ub. edu/rfa/R/ 


Part (I) covers from Chapters 1 to 5. With respect to risk measures, it 
seemed adequate to deepen in the advantages and pitfalls of most com- 
monly used risk measures in the actuarial and financial sectors, because the 
discussion could result attractive both to practitioners and supervisor au- 
thorities. This perspective allows to list some of the additional proposals 
that can be found in the academic literature and, even, to devise a family of 
alternatives called GlueVaR. Chapters in this part are structured as follows: 
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Chapter 1 - Preliminary concepts on quantitative risk measurement 


This chapter contains some preliminary comments, notations and defini- 
tions related to quantitative risk assessment to keep the book as self-contain- 
ed as possible. 


Chapter 2 - Data on losses for risk evaluation 


A descriptive statistical analysis of the dataset used to illustrate risk mea- 
surement and allocation in each chapter of the book is here presented. 


Chapter 3 - A family of distortion risk measures 


A new family of risk measures, called GlueVaR, is defined within the class of 
distortion risk measures. The relationship between GlueVaR, Value-at-Risk 
(VaR) and Tail Value-at-Risk (TVaR) is explained. The property of subad- 
ditivity is investigated for GlueVaR risk measures, and the concavity in an 
interval of their associated distortion functions is analyzed. 


Chapter 4 - GlueVaR and other new risk measures 


This chapter is devoted to the estimation of GlueVaR risk values. Analytical 
closed-form expressions of GlueVaR risk measures are shown for the most 
frequently used distribution functions in financial and insurance applica- 
tions, as well as Comish-Fisher approximations for general skewed distribu- 
tion functions. In addition, relationships between GlueVaR, Tail Distortion 
risk measures and RVaR risk measures are shown to close this chapter. 


Chapter 5 - Risk measure choice 


Understanding the risk attitude that is implicit in a risk assessment is crucial 
for decision makers. This chapter is intended to characterize the underlying 
risk attitude involved in the choice of a risk measure, when it belongs to the 
family of distortion risk measures. The concepts aggregate risk attitude and 
local risk attitude are defined and, once in hand, used to discuss the ratio- 
nale behind choosing one risk measure or another among a set of different 
available GlueVaR risk measures in a particular example. 


Part (II) covers from Chapters 6 to 8. Capital allocation problems fall on 
the disaggregation side of risk management. These problems are associated 
to a wide variety of periodical management tasks inside the entities. In an 
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insurance firm, for instance, risk capital allocation by business lines is a fun- 
damental element for decision making from a risk management point of 
view. A sound implementation of capital allocation techniques may help 
insurance companies to improve their underwriting risk and to adjust the 
pricing of their policies, so to increase the value of the firm. Chapters in this 
part are structured as follows: 


Chapter 6 - An overview on capital allocation problems 


There is a strong relationship between risk measures and capital allocation 
problems. Briefly speaking, most solutions to a capital allocation problem 
are determined by selecting one allocation criterion and choosing a particu- 
lar risk measure. This chapter is intended to detect additional key elements 
involved ina solution to a capital allocation problem, in order to obtain a de- 
tailed initial picture on risk capital allocation proposals that can be found 
in the academic literature. 

Personal notations and points of view are stated here and used from this 
point forward. Additionally, some particular solutions of interest are com- 
mented, trying to highlight both advantages and drawbacks of each one of 
them. 


Chapter 7 - Capital allocation based on GlueVaR 


This chapter is devoted to show how GlueVaR risk measures can be used to 
solve problems of proportional capital allocation through an example. Two 
proportional capital allocation principles based on GlueVaR risk measures 
are defined and an example is presented, in which allocation solutions with 
particular GlueVaR risk measures are discussed and compared with the so- 
lutions obtained when using the rest of alternatives. 


Chapter 8 - Capital allocation principles as compositional data 


In the last chapter, some connections between capital allocation problems 
and aggregation functions are emphasized. The approach is based on func- 
tions and operations defined in the standard simplex which, to the best of 
our knowledge, remained an unexplored approach. 
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PART | 


RISK ASSESSMENT 


1 Preliminary concepts on 
quantitative risk measurement 


This chapter is structured in two parts. The first one is intended to com- 
pile a set of theoretical definitions that we consider useful and relevant for 
quantitative risk managers. These definitions are related to the quantitative 
risk assessment framework of unidimensional risk factors, so other key is- 
sues like multivariate dependence are not covered herein. In our opinion, 
the concepts addressed in this chapter are the building blocks of unidimen- 
sional risk measurement which need to be helpful to practitioners. A care- 
ful first reading of this part is not necessary if one is already familiar with 
the fundamental ideas, because our aim is to leave it as a reference point 
to which to go back whenever needed. The second part serves to introduce 
ideas to bear in mind when moving from theory to practice. As before, this 
selection is subjective and it relies on our judgment, and the reader could 
consider the subjects in this selection too specific or too obvious. This is 
also the reason why we close the chapter with some brief remarks, in which 
we provide additional topics to be aware of and selected references in the 
literature to become an expert on risk quantification. 


11 Risk measurement - Theory 


11.1 First definitions 


Definition 1.1 (Probability space). A probability space is defined by three 
elements (Q, £, P). The sample space Q is a set of all possible events of a 
random experiment, & is a family of the set of all subsets of Q (denoted 
as &@ € —Y(Q)) with a o-algebra structure, and the probability P is a map- 
ping from to [0,1] such that P(Q) = 1, P(@) = 0 and P satisfies the 
o-additivity property. 
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Some remarks regarding the previous definition. 2 has a o-algebra struc- 
ture if Q € of, if A € æ implies that Q~ A = A® € A and ifUn>1 An € & for 
any numerable set {An}n>1. Additionally, the o-additivity property afore- 
mentioned states that if {An}n>1 is a succession of pairwise disjoint sets be- 
longing to of then 

+00 +00 

e[Û An) =} P(An). 

n=1 n=1 
A probability space is finite, ie. Q = {®1,@2,...,@n}, if the sample space 
is finite. Then (Q) is the o-algebra, which is denoted as 2°) In the rest 
of this book, N instead of Q and m instead of © are used when referring 
to finite probability spaces. Hence, the notation is (N a ,P), where N = 
{M,M2,..., Mp}. 


Definition 1.2 (Random variable). Let (Q, æ, P) be a probability space. A 
random variable X is a mapping from Q to R such that X~!((—oo, x]) := 
{(OEQO:X(O) <xseP,VXER. 


A random variable X is discrete if X (Q) is a finite set or a numerable set 
without cumulative points. 


Definition 1.3 (Distribution function of a random variable). Let X be a ran- 
dom variable. The distribution function of X, denoted by Fx, is defined by 
Fx (x) := P(X7!((-oo, x])). The notation P(X < x) = P(X! ((-co, x])) 
is commonly used, so expression Fx (x) = P(X < x) is habitual. The distri- 
bution function of a random variable is also known as the cumulative dis- 
tribution function (cdf) of that random variable. 


The distribution function Fx is non-decreasing, right-continuous and satis- 
fiesthat lim Fy(x)=Oand lim Fy(x)=1. 
xXx——oo xX—+00 


Definition 1.4 (Survival function of a random variable). Let X be a ran- 
dom variable. The survival function of X, denoted by Sx, is defined by 
Sx (x) := P(X7!((x, +00))). The following notation is commonly used, P(X 
> x) = P(X7!((x,+00))), so expression Sx(x) = P(X > x) is habitual. So, 
the survival function Sx can be expressed as Sx (x) = 1— Fx (x), forall x € R. 


The survival function Sx is non-increasing, left-continuous and satisfies that 
im. Sx(x) = land Jim. Sx(x) = 0. Note that the domain of the distri- 
bution function and the survival function is R even if X is a discrete ran- 
dom variable. In other words, Fx and Sx are defined for X (Q) = {x1, X2,..., 
Xn,...} but also for any x € R. 
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Definition 1.5 (Density function). A function f defined from R to R is a 
density function if f > 0, if it is Riemann integrable in R and if the follow- 
ing equality holds: 


+00 
f fOdt=1. 


A random variable X is absolutely continuous with density fx if its distri- 
bution function Fy can be written as Fx (x) = es fx(@dt for all x € R. 
Let us remark that, in such a case, the derivative function of Fx is fx, so 
dFx(x) = fx(x). 

If X is a discrete random variable such that X (Q) = {x1, X2,...,Xy,...} then 
forif x € {X1,X2,...,Xn,..-}, the density function may be defined by fx (x) = 
P(X = xi) and fx(x) = 0 if x € {x1, X2,...,Xn)..-}. 

Apart from discrete and absolutely continuous random variables there are 
random variables that are not discrete neither absolutely continuous but 
belong to a more general class. These random variables are such that their 
distribution function satisfies that 


Fy (x) = (1- p): FE (x) + p- Fix) (1.1) 


for a certain p € (0, 1), and where F is a distribution function linked to an 
absolutely continuous random variable and Ee is a distribution function as- 
sociated to a discrete random variable X@ with X@(Q) = E A XG pois r sachs 


Definition 1.6 (Mathematical expectation). Three different cases are con- 
sidered in this definition. 


Discrete case 


Let X be a discrete random variable with X (Q) = {x], X2,...,Xn,...}. X has 
finite expectation if LS |x;|-P(X = x;) < +00. Ifthis condition is satisfied 
then the mathematical expectation of X is E(X) € R, where E(X) is defined 
by 


+00 +00 
E(X) = } xi P(X =x) = Do xi fx (x). 
i=1 i=1 


Absolutely continuous case 


Let X be an absolutely continuous random variable with density function 
fx. X has finite expectation if S |x|- fx(x)dx < +00. If this condition is 
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Table 1.1 Examples of random variables 


Type of r.v. Name ofrv. Distribution function 
Binomial M) k m-k 
Discrete l Fx(x)= | | -4-A -q) 
X ~ B(m,q) È k 
Absolutely Normal, F x(x) q i 
continuous X ~ N(u,07) -ex l- t— 2} at 
j -œ OV 20 7 20° aH 
Fx(x) = 
Mixed Mixed 0 ifx<0O 


exponential 


1—(1- p)-exp{-A- x}ifx 20 


The probability of {X = 0} is equal to p € (0, 1), the 
probability of {X < 0} is zero and strictly positive values 
have assigned a probability of and exponential rv. of 
parameter A > 0, additionally multiplied by 1 — p. 


satisfied then the mathematical expectation of X is E(X) € R, where E(X) 
is defined by 


+00 
E(X) -f |x|- fx(x)dx < +00. 


CO 


General case 


Let X be arandom variable with distribution function of the form (1.1), and 
such that 


p-Fi(xy= > F(a) = tim Fx(0)] = » P(X =x), 


XjSX XjSX 


d- p): F(x% = Fx(x) - p- Fa (x) -f fdt, 


where {x1, X2,...,Xn,...} is the set of discontinuity points of Fy. In this case, 
ifthe random variables linked to F a and F$ respectively have finite expec- 
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Figure 1.1 Graphs of distribution and survival functions of random variables from 
Table 1.1, with parameters m = 100, q = 5%, u = 0,0 = 1, p = 70% and A = 0.02. 


Binomial Distribution Function Binomial Survival Function 


00 04 08 
00 04 08 


0 5 10 15 0 5 10 15 


m=100 , q=0.05 m=100 , q=0.05 


Normal Distribution Function Normal Survival Function 


oo 04 08 
00 04 08 


mu=0 mu=0 , 
, Sigma=1 sigma=1 


Mixed Exponential Distribution Function Mixed Exponential Survival Function 


00 04 08 
00 04 08 


p=0.7, lambda=0.02 p=0.7, lambda=0.02 


tation then 


+00 +00 

E(X) = }_ xi- P(X = xi) -f x- f(x)dx. 
i=l -00 

Note that the differential function of a distribution function Fy, which will 

be denoted d Fx and is usually known as probability density function (pdf), 

may be defined by 


P(X = xi) if xE€{x1,X2,..., Xn -p 
dF,(x) = (1.2) 

fy (x) if x € {X1,Xo,..., Xn)... 
Taking advantage of this notation, ifthe random variables involved have fi- 
nite expectation then the mathematical expectation in the discrete, the ab- 
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solutely continuous or the general cases can always be written as 


+00 
EW) = f x-dF x(x). 


CO 


This expression unifies the ones used in Definition 1.6 and makes further 
reading easier than more complicated notation. 

The following result will be really helpful in several parts of this book, al- 
though comments on its usefulness cannot be provided at this stage. The 
result shows how to interpret the mathematical expectation of a random 
variable in terms of its survival function. 


Proposition 1.1. Let X be a random variable with finite expectation. The fol- 
lowing equality holds: 


0 +00 
EX) = | (Sx()- Dats | Sx(t)dt. (1.3) 
co 0 


Proof. Each summand in (1.3) is treated separately, despite the idea behind 
the proof is basically the same. First of all, consider 


0 +00 
a= f (Sx(t)—1)dt and b= | Sx(f)dt. 
tore) 0 


With this notation, E(X) = a+ b has to be proved. In order to prove that, let 
us recall that E(X) = [tS x- dF x(x) and rewrite this last expression as 


0 +00 
E00 = | xd F(x) + | x: dFx(x)=a'+b'. 
-00 0 


Using Fubini’s theorem in (+): 


+00 +00 x 
b= [ xd Fx(x)= | [f at] dF x(x) 
0 t= 


(=) +00 +00 +00 
ai [f ary(o)ar= f (Fx (+00) — Fx (t))dt 
t=0 x 


=t 


+00 +00 

-f a-Fxdr= | Sx(t)dt 
t=0 0 

=b. 
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0 0 x 
d=f x-dF x(x) = | [f ar) dFx(x) 
—00 x=—00 UJ t=0 
w f? t 0 t 
2 f | f (dF x(x) dt = f | f (asx(xo)} ar 


0 
-f (Sx (0) — Sx(-0o))dt 
t 


rae 
-f (Sx(t)-1)dt 
t=—00 


=a. T 


The proposition has been proved, using that Fy(+00) = im. Fx (x) = 1, 

Sx(-oo) = lim Sy(x) = lim (1—- Fy(x)) =1- lim Fy(x) = 1 and 
X——oCo xXx——oo xXx——oo 

dS x(x) = d(1— Fx(x)] = —dFx (x). 


Definition 1.7 (Risk measure). Let T be the set of all random variables de- 
fined for a given probability space (QO, .2/, P). A risk measure is a mapping p 
from T to R, so p(X) is a real value for each X €T. 


Frequently, the set T is considered to be the set of p-measurable functions 
defined on the probability space, p > 0. In other words, frequently [ = 
LP{(Q, A, P)}. For more details see, for instance, Riischendorf [2013] and 
the references therein. 

The most frequently used, or well known, risk measures in the insurance 
and financial industry are listed in next paragraph. It has to be noted that 
insurance and financial perspectives may differ in some aspects. Detailed 
comments on these differences are provided in Section 1.2. Our perspective 
is the actuarial one and, hence, the following definitions are aligned with 
this point of view. In fact, these definitions are basically taken from Denuit 
et al. [2005]. The reason of including these definitions is to avoid possible 
misunderstandings due to differences in names given to certain risk mea- 
sures, 


Definition 1.8 (Value at Risk). Let us consider a € (0,1). The function 


VaRa :T — R 
X — VaR (X) = inf {x | Fx (x) = a} 
is a risk measure called Value at Risk at confidence level a. If Fx is continuous 


and strictly increasing then VaRq(X) = F r (X), where Fy is the inverse of 
the distribution function of random variable X. 
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Definition 1.9 (Tail Value at Risk). Let us consider a € (0,1). The function 
TVaRy, : T — R 
1 1 
X — TVaR4(X) = —|/| VaR, (x)da 
1 =Q a 
is a risk measure called Tail Value at Risk at confidence level a. 


Definition 1.10 (Conditional Tail Expectation). Let us consider a € (0,1). 
The function 


CTE, :T —R 
X — CTEg(X) =E[X | X > VaRg(X)] 


is a risk measure called Conditional Tail Expectation at confidence level a. 


Definition 1.11 (Conditional Value at Risk). Let us consider a € (0,1). The 
function 


CVaRg : T — R 
X — CVaRa(X) = E [X - VaRa (X) | X > VaRa(X)] 
= CTE, (X) — VaRg (X) 


is a risk measure called Conditional Value at Risk at confidence level a. 
Definition 1.12 (Expected Shortfall). Let be a € (0,1). The function 


ES« : T — R 
X — ESq(X) =E[(X -VaRq(X)),] 


is a risk measure called Expected Shortfall at confidence level a. Notation 
(t)+ is used to refer to the function that returns 0 if t < 0 and t otherwise. 


The following relationships between previous risk measures hold, as stated 
in Section 2.4 of Denuit et al. [2005]: 


TVaRq(X) = VaRa(X) + — -ESa(X), (1.4) 
1 
a = a Tar ee ae pie, "E a X ’ . 
CTEa(X) = VaRa (X) + opoo E (1.5) 
ESq(X) 
CVaRq(X) = (1.6) 


Sx (VaRq(X)) ` 
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Note that relationships (1.4) and (1.5) imply that, if the distribution func- 
tion of random variable X is continuous and strictly increasing then TVaRg 
(X) = CTE, (X) because 


Sx(VaRq(X)) = 1— Fx (VaRa(X)) = 1- Fx (Fz! (a))= 1- a. 


This is the reason of finding expressions like: ‘roughly speaking, the TVaR is 
understood as the mathematical expectation beyond VaR in this book. 


Example 1.1 (Illustrative exercise). Let us consider the following random 
variable X, that measures a loss, i.e. an economic value that can be lost 
with a certain probability, 


Xi —100 0 50 200 500 
pi = P(X= xi) 0.2 0.5 0.25 0.04 0.01 


a) Calculate VaRg (X), TVaRg(X) and CTE, (X) for a = 90% and for a = 
99%. 


= 


Explain if a loss X which is distributed like in the table presented here 
can produce a TVaR at the 90% level that is equal to 180. 


c) Find the value that must substitute 200 so that the results exactly corre- 
spond to ES90% (X) = 13, for a confidence level equal to 90%. Verify also 
that if we replace value 200 by 250 and value 500 by 550, then we obtain 
again the same results for a confidence level equal to 90%. 


= 


Based on the ideas in step c), explain why the value of the risk measures 
do not determine in a unique way the distribution of a random loss. 


Solution a) In order to make calculations easier, we complete the initial 
table with two additional rows. One corresponds to the distribution func- 
tion (cdf) of random variable X and the other is the corresponding survival 


function. 
Xi —100 0 50 200 500 
pi=P(X= xi) 0.2 0.5 0.25 0.04 0.01 
Fy (xj) 0.2 0.7 0.95 0.99 1 
Sx (xj) 0.8 0.3 0.05 0.01 0 
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We calculate the values of VaRgo% (X) and VaRgg% (X) using Definition 1.8 
VaRq(X)and the information displayed on the table. So, 

VaRg0% (X) = inf{x | Fx (x) > 90%} = 50, 

VaRg9% (X) = inf {x | Fx(x) = 99%} = 200. 
Both for the calculation of TVaR and CTE, we need to obtain the value of 


ES beforehand. Let us remind the definition of the latter for a loss random 
variable X and a confidence level a € (0, 1): 


ESq(X) = E[(X — VaRq(X))+]. 


Note that we need to consider Zx,q¢ = (X — VaRg(X))+, which is equal to 
zero when x; — VaRqg(X) < 0 and which is equal to x; — VaRa (X) when the 
difference is positive. Let us add two more lines to the table that has been 
used in this exercise, corresponding to values Zx,90% and Zx,99%: 


Xi —100 0 50 200 500 
pi=P(X=x;j) 0.2 05 0.25 0.04 0.01 
Fx (xj) 0.2 0.7 0.95 0.99 1 

Sx (xj) 0.8 0.3 0.05 0.01 0 

(x; — 50)+ 0 0 0 150 450 
(x; — 200)+ 0 0 0 0 300 


Therefore, 


5 
ES90% (X) = L (x; — 50)+ +p; = 150- 0.04 + 450 - 0.01 = 6 + 4.5 = 10.5, 
i=1 
5 
ES99%(X) = }_ (x; — 200), - p; = 300 -0.01 = 3. 
i=1 
Once the values for ES are obtained, then we can calculate TVaR and CTE 
using the following expressions: 


1 
TVaRq(X) = VaRg (X) + iva’ ESq(X) 
and 


1 
CTE (0) = VaRa (X) + ————— BS), 
dP TG 6) a 
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TVaRgox% (X) = 50 + (1/0.1)10.5 = 155, 
TVaRg9%(X) = 200 + (1/0.01)3 = 500; and 


CTEgo% (X) = 50 + (1/0.05)10.5 = 260, 
CTEg9% (X) = 200 + (1/0.01)3 = 500. 


b) The random loss X that is considered in this exercise cannot correspond 
to another loss if some values of the risk measures at the confidence level of 
90% are different to the risk measures obtained for the loss. For example, if 
the TVaR at the 90% level is 180 while we just saw that TVaR at the confi- 
dence level of 90% is 155 for the loss in this exercise, then the two random 
variables differ in their distribution. 


c) Let us fix the level of confidence to 90%. Let us note in that case that 
the source of the difference between the risk measures TVaR and CTE in 
two cases is in the value of ES9qy%,(X). For instance if the value is 13, while 
it is 10.5 in section a) of the current exercise. Then, when looking at the 
calculation of ESgq%(X), what needs to be done is to look at the following 
equation: 


(x4 — 50) -0.04 + 450 -0.01 = 13, with x4 > 50. 
Then, solving the previous equation, we obtain 
x4 = 25- [13 — 4.5 + 2] = 25- [10.5] = 262.5. 


Furthermore, ifwe change x4 = 200 by x4 = 262.5 we obtain the results that 
we were aiming at, namely, 


VaRg0% (X) = 50, ESoo% (X) = 13, CVaRg0% (X) = 260, 
TVaRgo%(X) = 180, and CTE90%(X) = 310. 


The variant proposed here is to consider now that x4 equals 250 and x5 
equals 550, and leaving all other x; as they were initially set. So, the value of 
ES90% (X) is calculated as 


(250 — 50), - 0.04 + (550 — 50), -0.01 = 200 - 0.04 + 500 - 0.01 
=8+5=13. 
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Therefore, with this change, we obtain 


VaRg0% (X) = 50, ESoo% (X) = 13, CVaRgom% (X) = 260, 
TVaRoox% (X)=180 and CTE90% (X)=310. 


d) In the previous paragraph, we deduce that at least, there are two random 
losses that have the same values for 


VaRg0%(X), ES90%(X), CVaRgo%(X), 
TVaRgom (X) and CTEg09% (X). 


As a consequence, we have just seen that the values of the risk measures do 
not determine in a unique fashion the cumulative probability function for a 
random variable. 


1.1.2 Properties for risk measures 


A list of properties that a risk measure may or may not satisfy is presented 
herein. Most of these properties have an economic interpretation or, at 
least, a relationship with some features that practitioners (the ones who 
want to quantify risk) demand to the risk measure (the instrument to quan- 
tify risk). In order to summarize the properties and their interpretation, Ta- 
ble 1.2 is provided. 


Table 1.2 Properties for risk measures 


Property Idea behind the property 
Translation If a positive non random quantity c is added to 
invariance random loss X then it is required to the risk mea- 


p(X +c) = p(X) +c, Sure that the risk value of the new loss should be 

YceR increased by the same quantity. Otherwise, if the 
quantity c is negative (so a protection buffer has 
been added to the original random loss X) then 
the risk measure should reflect this buffer as a net 
effect on the original risk value. 


Subadditivity If a risk measure satisfies this property then it is 
p(X, + Xo) < able to quantitatively reflect the idea that diver- 
p(X1) + p(X2) sification is a strategy that does not increase risk. 


Continued on next page 
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Table 1.2: continued from previous page 


Property 


Idea behind the property 


Monotonicity 
P(X, < X2)=1> 
P(X1) < p(X2) 


If losses of a position are almost surely worse than 
losses of another position, then the risk value of 
the former should be greater than the risk value 
of the latter. 


Positive 
homogeneity 
p(c-X)=c- p(X), 
Vc>0 


If losses to which the risk manager is exposed are 
multiples of a particular loss, then it is required 
that the risk measure of the overall risk should be 
the same multiple of the risk value of that partic- 


ular loss. 
Comonotonic Informally, two random variables are comono- 
additivity tonic if they are linked to another random vari- 
X; and X> able that drives their behavior. This property 
cómonotonit = is intended to identify those risk measures that 
p(X, + X2) = take into account this underlying relationship be- 
p(X1) + p(X2) tween comonotonic random variables and, as a 
consequence, they do not assign quantitative di- 
versification benefits when considering the sum 
of those random variables. 
Convexity This is a sort of generalization of the subaddi- 
p(A-X1 + (1—A)- Xe) tivity property. If the risk figure of any linear 
<A- p(Xı)+ combination of two random variables is smaller 
ApG, than the assouated linear combination ar risk fig- 
ures, then the risk measure captures diversifica- 
YAE (0,1) 


tion benefits in a continuous way. Note that if 
the risk measure is convex and positively homo- 
geneous and considering X; = 2- X; and À = 1/2, 
then the subadditivity property for X;, i = 1, 2 is 
obtained. 


Continued on next page 
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Table 1.2: continued from previous page 


Property Idea behind the property 

Law invariance If two random variables have identical distribu- 
(objectivity) tion functions then it is required to the risk mea- 
If P(X, <x)= sure that their risk values should be identical too. 


P(X» <x), VxeER 
then p(X1) = p(X2) 


Relevance If a random loss is not zero then its risk value 
IfX>0andX#0 should be strictly positive. 
then p(X) >0 


Strictness This property is intended to detect those risk 

p(X) > E(X) measures that are conservative enough to be used 
as a management tool, in other words, risk values 
based in risk measures that satisfy this property 
are always greater that the expected loss. 


For any random variables Xj, X2, X ET. 


Financial and actuarial literature are plenty of interesting proposals of risk 
measures. Details on some of these proposals are provided in Chapters 3 
and 4 and, in addition, several other references are pointed out therein. 


1.2 Risk measurement - Practice 


Let us start this section with Table 1.3, in which closed-form expressions are 
provided for VaR and TVaR where random variable X is distributed as a Nor- 
mal (^), a Lognormal (Z MN) and a Generalized Pareto (4) distribution. 
Notation conventions are used. Namely, h and © stand for the standard Nor- 
mal pdf and cdf, respectively. The standard Normal distribution œ-quantile 
is denoted as qq = l(a). For the GF distribution, the definition pro- 
vided in Hosking and Wallis [1987] is considered, where the scale parameter 
is denoted by o and k is the shape parameter. The 4 distribution con- 
tains the Uniform (k = 1), the Exponential (k = 0), the Pareto (k < 0) and 
the type II Pareto (k > 0) distributions as special cases. Table 1.3 is basically 
taken from Sandström [201]. 
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Table 1.3 Analytical closed-form expressions of VaR and TVaR for selected random 
variables 


Random variable a Expression 
measure 

Normal distribution VaRa H+O° da 

2 (qa) 
Ge) TVaRg H+ O-5—— 

-a 
Lognormal VaR a exp(Ul+ 0 - qa) 
ap miburon a2) Do- qa) 
LN (u,0*) TVaRq exp| H+ |]: —__ 
2 l-a 
Generalized Pareto VaRq —~o-In(l—a) 
distribution 
GPF(0,o) TVaRq o-[1-In(1-a)] 
(Exponential distribution) 
Generalized Pareto VaRq g iG a)*] 
distribution k 
GPQDk,o) with +00 if k<-1 
k<0 RaR Tii a jas k-(1—a)* 
— — — a — — 
aM k k| k+l 
if ke (-1,0) 


1.2.1 ‘Liability side’ versus ‘asset side’ perspectives 


No matter if you come from the insurance or from the financial industry: 
in both cases you agree on thinking on risk in terms of random losses. Dif- 
ferences arise when quantifying risk in practice, because usually an actuary 
works with random variables in which positive values identify losses and, 
therefore, she is worried about what happens in the right tail of the dis- 
tributions. Nonetheless, a practitioner from the financial industry usually 
works with random variables where positive values identify gains or profits, 
so she is mainly worried about the behavior of the left tail of the distribu- 
tions. Therefore, depending on where you come from, you would be used to 
look at risk quantification from different perspectives. More precisely, we 
should talk about ‘liability side’ practitioners and ‘asset side’ practitioners 
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instead of ‘insurance’ and ‘financial’ practitioners. For instance, an example 
of financial practitioners that take (what we have called) a ‘liability side’ per- 
spective when quantifying risk are those in charge of assessing credit risk. 
On the other side, as we will discuss later, the perspective used in European 
insurance regulation to quantify solvency capital requirements is an ‘asset 
side’ perspective and not a ‘liability side’ perspective (as it could be expected 
because of the nature of this industry’s business). 

Although moving from one perspective to the other is not a big issue, few 
guidelines to reach this goal are outlined. It is our opinion that these are the 
kind of helpful indications that bridge the gap between theory and prac- 
tice, and between insurance (‘liability side’) and financial (‘asset side’) prac- 
titioners. The following guidelines are summarized in Table 1.4, in order to 
provide a fast and visual reference when needed. 


Table 1.4 Risk quantification: ‘liability side’ versus ‘asset side’ perspectives 


Liability side Asset side 
Concept . F 
perspective perspective 
Notation for risk 
measures used in p r 
this Table 
we ae X a random loss X a random profit 
variable 
Monotonicit A o POQ= ay) = l= 
y (Xi) < p(X) 1(X1) > r(X2) 
From the liability side perspective, smaller losses should 
have associated smaller risk measurements. On the as- 
set side perspective, the higher the gain the lesser its risk 
value. 
Translation p(X+c)=p(X) +e, r(X+c)=r(X)-c, 
invariance VceR VYceR 


Continued on next page 
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Table 1.4: continued from previous page 


Concept 


Liability side Asset side 
perspective perspective 


A positive amount of money from the liability side per- 
spective may be considered as a loss, while from the as- 
set side perspective it is exactly the opposite. Therefore, 
if the risk measure satisfies the translation invariance 
property, a positive amount of money must increase risk 
from the liability side perspective while the same positive 
amount of money must decrease risk from the asset side 
perspective. 


Relevance 


X20and X 40> X<Oand xX 40> 
p(X) >0 r(x) >0 


Strictness 


p(X) > E(X) r(X) > -E(X) 


Recalling that X represents a random loss from the 
liability side perspective and a gain from the asset 
side perspective. 


Subadditivity, 
Positive 
homogeneity, 
Comonotonic 
additivity, 
Convexity, Law 
invariance 


Formal expressions from both perspectives remain 
as they are displayed in Table 1.2, except for replacing 
p byr. 


For any random variables X1, X2, X ET. 


Additional comments with respect to differences among the ‘liability side’ 


and the ‘asset side’ perspective for risk quantification may be found, for in- 


stance, in Riischendorf [2013]. As an example, Definition 1.8 has been intro- 


duced from a ‘liability side’ perspective, so positive values of random vari- 


able X are considered losses. Considering expressions in Definition 1.8 and 
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adopting an ‘asset side’ perspective, if one is interested in obtaining the VaR 
at a confidence level for a continuous random variable Z with positive val- 
ues representing profits, then the correct risk figure would be obtained as 


‘VaR of Z at confidence level a € (0, 1)’ 


= —VaR\_q(Z) following Definition 1.8. (1.7) 


The perspective taken in the following chapters of this book is the one that 
we have called ‘liability side’ perspective. 


1.2.2 Some misunderstandings to be avoided in practice 
Risk measures versus their estimates 


It is quite frequent to confuse a risk measure with the procedures used to 
estimate it. These two concepts are different and their identification can 
lead to misunderstandings. Fortunately, the spread of knowledge about risk 
measurement makes these kind of doubts less frequent than they were be- 
fore. But when having first contact with risk measurement (for instance, if 
you are an undergraduate student interested in this topic or a recently hired 
practitioner without previous experience in the insurance industry or the 
financial sector) this is one of the most common mistakes. Diagram in Fig- 
ure 1.2 may help to clarify concepts. 


Figure 1.2 Basic mind map for risk quantification. 


Assumptions 


Theory Practice 


To ô = 
Risk measure (p) LP, | Risk figure est. P(X) 


| 


R.v. estimation (x ) 


Random variable (X) ae 


Figure 1.2 is intended to depict a schematic situation faced when trying to 
quantify risk. On the one hand, theoretical aspects related to the risk mea- 
sure (the instrument to summarize risk) and the target random variable (the 
source of risk) must be taken into account. These theoretical aspects are 
represented on the left hand side of the diagram, and should correspond 
to answers to questions such as the following: Is the selected risk measure 
adequate? Is the target random variable observable?...On the other hand, 
figures are basic in practice. As long as the final objective is to obtain an 
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estimate of the incurred risk (framed box in Figure 1.2) assumptions have to 
be in place to move from theory to practice. So, the assumptions made to 
estimate both the risk measure and the target random variable become cru- 
cial. They are so relevant that, from our point of view, they can lead to the 
confusion that we are highlighting here. This is because, in daily practice, 
one could deliver risk figures estimations (right hand side of the diagram) 
without worrying about theoretical aspects (left hand side). As mentioned 
before, let us put some examples. 


Example 1.2 (Historical VaR). Measuring risk in practice using the histor- 
ical VaR methodology has been relatively common because it has an easy 
implementation. Properly speaking, it is not a unique methodology as we 
try to justify hereinafter. From the point of view provided by the diagram in 
Figure 1.2, on the theoretical side this methodology takes into account as risk 
measure p the VaR with some confidence level a € (0, 1) and considers that 
the target random variable X is observable. Moreover, it is assumed that ob- 
servations of that random variable from past periods can be obtained. The 
assumptions for moving from theory to practice are as follows: with respect 
to the estimation of the target random variable X, it is assumed that future 
realizations will be exactly the same as past realizations, so past observa- 
tions that have been obtained are going to be considered future observa- 
tions too. And with respect to the estimation f of VaR, there is not a unique 
feasible assumption (and this is why we consider the ‘historical VaR’ a set 
of methodologies and not just one). For instance, a feasible assumption is 
to consider the data set of observations of X as it represents the discrete 
random variable X which only takes those particular values and no more. 
Consequently, VaR should be estimated as the empirical a-quantile of that 
set. But, if the data set of observations of X is considered just a sample of X, 
then any a@-quantile approximation! of data set X could be used to obtain 
the final risk figure estimation 0 (X) of p(x). 


Example 1.3 (Normal VaR). Bearing in mind diagram in Figure 1.2, this me- 
thodology takes as theoretical risk measure p the VaR at some confidence 
level a € (0, 1), and considers as target random variable X one which is as- 
sumed to be normally distributed. Assumptions to move from the theoreti- 
cal side to the practical one are as follows: with respect to X, it is assumed 


1 For instance, quantile function in software R has more than 10 different ways to approxi- 
mate the a-quantile, where the one coded by 0 is what we have called the empirical quan- 
tile. Even MS Excel has implemented functions INC.PERCENTILE and EXC.PERCENTILE 
which return different approximations of the a-quantile. 
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that X ~ N(u, o°) forsome u € Rando > 0, and that the practitioner is able 

to estimate u and g in some way (maybe from data or from expert judgment, 

for instance), so it is feasible to obtain [i and G estimates of u and øg, respec- 
tively. With respect to the risk measure, the assumption made on the ran- 
dom variable implicitly provides a closed-form expression for VaR, because 

if X ~ N(u, 0°) then VaRa (X) = U +0: qa, where qq is the a-quantile of a 

standard normal distribution (as it has been shown in Table 1.3). As it hap- 

pened with the historical VaR methodology, the Normal VaR methodology 
may be understood as a set of methodologies depending on the particular 

chosen way for estimating the parameters of the distribution. In the end, 

p(X) is estimated by i+ 6: qa. 

Note that the Normal VaR methodology is frequently used for sums of nor- 

mally distributed random variables. On the theoretical side, ifn > 1 random 

variables X; ~ N(ui, 04); i = 1,...,,are considered and A = (Pi j)i,je{(1,...,n} 
is the correlation matrix for pairs of those random variables, then it is known 

that 


where o” = ji’ - A- ji and [iis an n-dimensional vector whose components 
are Hi, Í = 1,..., N. So, the situation is just the one described in the previous 
paragraph, taking as u = } ¢_} Hi and as ø = \/pl’- A-{i. In this case, the 
process to obtain parameter estimates fì and @ must take into account that 
correlation coefficients p;; should also be estimated. In other words, 


N A 
> > 


G=\p-A-p. 
Example 1.4 (Cornish-Fisher VaR). As in the previous examples, different 
methodologies are embraced under this name. They share the following ele- 
ments: on the one hand, the theoretical risk measure p is the VaR with some 
confidence level «æ € (0,1) and no hypothesis about the distribution func- 
tion of the target random variable is made. Nonetheless, it is assumed that 
some higher order moments of X exist and are finite. On the other hand, as- 
sumptions for moving from the theoretical side to the practical side are that, 
in order to obtain an estimation (X), a closed-form approximation simi- 
lar to the one valid for normally distributed random variables is achievable. 
For that purpose, modified a-quantiles are devised taking into account esti- 
mations of finite order moments of X. Differences between Cornish-Fisher 
VaR methodologies come from the maximum order of moments considered 
in the quantile estimations. For instance, in Chapter 4 we have used third 
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order Cornish-Fisher VaR approximations, but is is usual to find fourth order 
Cornish-Fisher VaR approximations in financial applications. 


VaR versus Mean-VaR 


An apparently harmless sentence like ‘most financial credit risk models used 
in practice to quantify risk are based on VaR at some confidence level’, which 
most practitioners and researchers in this field may subscribe, can have un- 
desired consequences if it is misunderstood. The main concern with the 
previous sentence is that nothing is said about the random variable to which 
the VaR is applied to: even considering the same confidence level and the 
same input data, different figures can be obtained depending on the under- 
lying random variable under inspection. For instance, a large number of 
banks use internal models to simulate losses generated by credit events af- 
fecting their loans. Let us focus on one bank and let us denote its aggregate 
simulated losses by X. Therefore, the amount of money needed to cover 
unexpected losses (its economic capital) is probably computed as 


EC= VaRo9.9% (X = E(X)) 


in order to take into account its simulated values and also regulatory require- 
ments (Basel I/II). Note that in this case, although the random variable 
simulated is X, the one used to quantify risk (i.e., to obtain the economic 
capital) is U = X — E(X), in fact. The VaR is a risk measure that satisfies the 
translation invariance property shown in Table 1.2 and, therefore, 


EC = VaRgg.9%(U) = VaRg9.9%(X) — E(X). (1.9) 


This last expression for the EC is certainly more familiar to financial practi- 


tioners. Moreover, sometimes p(X) = VaRg9,9%(X) —E(X) is considered the 
value that another risk measure p named ‘Mean Value at Risk’(Mean-VaR) 
returns when applied to random loss X. Expression (1.9) has been inten- 
tionally displayed in second place in order to stress the following idea. Let 
us imagine now an European insurance company calculating its Solvency 
Capital Requirement (SCR) under the Solvency II regulatory framework and 
by using an internal model. Let us suppose that within the model a set of 
stochastic basic own funds of the company for the next year is simulated. 
In such a case, if Y denotes the ‘basic own funds for the next year’ random 
variable, then taking into account expression (1.7) it seems reasonable that 
the following expression 


SCR= VaRo9.5% (- Y) = —VaRo 5% (Y) (1.10) 
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would be used to compute the SCR, because it perfectly fits the regulatory 
requirements”. But what it is relevant here is that it makes no sense to re- 
quire the company to set aside, as a cushion against insolvency, the follow- 
ing amount of money 


SCR = VaRg9.5%(— Y) — E(— Y) = VaRg9.5%(— Y ) + E(Y). (1.11) 


Due to misunderstanding of expression (1.9) for the EC, and transposing it 
for the SCR expression simply replacing X by —Y, figures with non eco- 
nomic sense are attained. Why? Basically because X and — Y are essentially 
different. Random variable X is a pure loss while — Y contains both losses 


and gains. In fact, hopefully E(— Y) « 0 (the insurance company expects 


substantial gains) and reasonably E(X) > 0 (the expectation ofa set of losses 
is also a loss). In words, when computing the EC the focus is set on random 
variable U = X — E(X) because it is assumed that the quantity E(X) is al- 
ready accounted for on the liability side of the balance sheet (which is not 
entirely simulated by the credit risk model) to mitigate credit losses. On the 
other hand, the model for the SCR of the insurance company is simulating 
the whole balance sheet. Therefore — Y is not comparable with X because 
losses associated to — Y are those that have exceeded all the mitigation tools 
and strategies that the company has in place, while X losses are computed 
gross of any mitigation effect. 


Example 1.5. A toy example can help us to illustrate the impact of such a 
misunderstanding. Imagine two insurance companies cı and C2, one with 
Yı, t = 100 monetary units (m.u.) of present basic own funds and the other 
with Y>;= 1 m.u. Both use the same model to project next year basic own 
funds (let us say Yj,¢+1 and Y2,+41) and the same methodology to compute 
VaR at the 99.5% confidence level. To simplify things, let us assume that 
E(Y; t+1) = Y; t for i = 1, 2, so the expectation of projected basic owns funds 


for the next year is nothing but the value of the actual basic own funds of 
each company. Imagine that the risk figures that these companies obtain 
are VaRo9.5% (- Yi t+1) = 5 and VaRg9.5% (- Yo, t+1) = 0.5. They may be in- 
terpreted in the following way: cı is going to suffer a minimum loss of a 5% 
of its present basic own funds in a 0.5% of the future scenarios considered, 
while c3 is going to suffer a minimum loss of a 50% of its present basic own 
funds in a 0.5% of the future scenarios considered. Interpreted in that way, 


? As it is shown with this expression, the core of the European insurance regulation uses 
what we have called an ‘asset side’ perspective when talking about risk quantification. 
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c2 seems highly riskier than cı. And this would properly be reflected us- 
ing expression (1.10), because their respective solvency capital requirements 
will be SCR(c)) = 5 m.u. and SCR(c2) = 0.5 m.u. which, in terms of their 
present basic own funds, represent reasonable risk proportions. But note 
that if misunderstandings are in place and expression (1.11) is used instead 
of expression (1.10) to compute their SCR, then SCR(c)) = 5+ 100 = 105 
m.u. and SCR(c2) = 0.5+ 1 = 1.5 m.u. are obtained. These figures are far 
from representing neither the risk faced by the companies nor their relative 
riskiness. 


Somebody could think that the previous examples overweight the impor- 
tance of items on the right hand side of Figure 1.2. These examples have 
been chosen because they correspond to common risk quantification issues 
found in practice and researchers must bear them in mind. Nevertheless, it 
is also our intention to aware that practitioners should spend some time on 
thinking of questions related to the left hand side of that Figure, this is, on 
theoretical aspects related to a practical risk quantification in a regular basis. 
Some of these questions are listed below, although it is neither an extensive 
list nor a prioritized one: 


- Have several risk measures been considered before the final selection is 
made? 

: Do these risk measures satisfy properties that we consider necessary? 

: Are these risk measures or their confidence levels regulatory driven? 


- Have we an idea about the implicit risk attitude behind using those par- 
ticular risk measures? 


* What are we looking for as the final result of this risk quantification pro- 
cess? 


- Are we aware about our capability (in terms of time, resources and knowl- 
edge) to transform ideas into numbers? In other words, for every consid- 
ered risk measure and every target random variable, do we know how to 
move from the theoretical side to the practical side? 


- Have we properly defined our target random variable? 


* Does the target random variable depend on other random variables easier 
to measure or identify? 


- How precise do we need to be in our estimations? 
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Hopefully, useful ideas about how to answer some of these question may be 
found in this book or in the references therein. We would like to close this 
chapter with some last remarks. As it has already been said, main references 
used to build this chapter are books Denuit et al. [2005] and Riischendorf 
[2013]. Note that the CTE risk measure introduced in Definition 1.10 is called 
Expected Shortfall (ES) in McNeil et al. [2005] and, therefore, there is also 
a difference with the Definition 1.12 of ES provided in this book. Moreover, 
names for several risk measures in Section 1.1 do not match the ones used for 
equivalent risk measures in Riischendorf [2013]. This remark makes evident 
that there is yet no common consensus in risk measures naming. 

For an interesting way to study basic risk measures but without a parametric 
model assumption, the work by Alemany et al. [2013] shows how to imple- 
ment kernel estimation of the probability density function and how to de- 
rive the risk measure from there. Kernel estimation is specially useful when 
the number of observations is large. Bolancé et al. [2003]; Buch-Larsen et al. 
[2005]; Bolancé et al. [2008] explain how to address heavy-tailed or skewed 
distributions. The interested reader can find several contributions using 
other models and non-parametric approaches in Bolance et al. [2008]; Guillen 
et al. [2011, 2013]. Bolancé et al. [2012, 2013] provide data-driven examples 
with R and SAS code in the context of operational risk problems. Multivari- 
ate risk quantification is addressed by Bolancé et al. [2014]; Bahraoui et al. 
[2014]. 

With respect to a deeper analysis of issues of Solvency II for practitioners 
and regarding theoretical aspects behind Cornish-Fisher expansions, the in- 
terested reader is referred to Sandström [2011]. Last but not least, one topic 
not covered by this book that has to be taken into account in risk quantifi- 
cation is the model risk. Aggarwal et al. [2016] provides a wide variety of 
approaches to deal with this real challenge and may be an interesting de- 
parture point to anyone interested in this topic. 


1.3 Exercises 


1. Consider the following empirical distribution 
13, 15, 26, 26, 26, 37, 37, 100 


Determine the VaRgs% (X) and TVaRgs~ (X). 


2 
x 
2. Consider the following distribution function F(x) = y for0< x <3. 


Find the VaRg5%(X) and TVaRg5% (X). 
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3. Given that 
VaRgom (X) = 50, ESg90% (X) =13 and CVaRgo% (X) = 260. 


a) Calculate TVaRgo% (X), Sx(VaRg90%(X)) and CTE90% (X). 


b) Discuss if itis possible that loss X would be an absolutely continuous 
random variable. 


4. Show that the TVaR of a random variable X distributed by the Normal 
ee ee i o(®*(@)) 
distribution V (u, o^) is equal to TVaRg = ut+oa- apes where 
a 


) and ©! stand for the standard Normal pdf and quantile function, re- 
spectively. 


a) Demonstrate that the properties of Translation invariance, Positive ho- 
mogeneity and Strictness are satisfied in this case. 
b) Repeat the exercise for the CVaRq. 


5. Analyze if the properties of Translation invariance, Positive homogeneity 
and Strictness are satisfied by the VaR and TVaR when: 


a) the random variable X is distributed by the Lognormal distribution 
LN (u, 0°). 


b) the random variable X is distributed by the Generalized Pareto dis- 
tribution 422 (0,0). 


2 Data on losses for risk 
evaluation 


Historical loss data is of key importance for risk management and model- 
ing of losses. This statistical information must be carefully analyzed and 
understood in order to extract the best possible knowledge. Therefore, it is 
important to collect details on the data sources, such as framework, time of 
collection, definitions, exceptions and so on. 

Insurance companies have always been collecting data for risk management 
purposes, but in the past information was rather aggregate whereas nowa- 
days, there is much more detailed knowledge on individual policy holders, 
events, claims handling and loss compensations. Nevertheless, data quality 
still remains a challenge for many insurers and the lack of robustness of sta- 
tistical information may sometimes pose huge problems to the undertaking 
of ambitious risk management initiatives. 

This chapter presents examples of loss data, which are used throughout some 
parts of this book. They are intended to be helpful to understand concepts. 
They do not intend to represent any particular insurance company. 
Thereafter, we continue by discussing several characteristics of the loss sever- 
ity information that has been specially created for this book and their aggre- 
gation by means of their sum. This analysis involves statistical description in 
this chapter and, in the following chapters, we address the implementation 
of risk measurement and capital allocation. Having a good understanding 
of amounts of losses is important not only for modeling purposes, but also 
to identify effective management and mitigating actions. 


21 An example on three dimensional data 


Historical loss information is the main source of knowledge for the specific 
experience and history of the organization. Sources of information on loss 
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events that occurred in comparable companies can sometimes be helpful to 
understand their position with respect to exposure and their corresponding 
risk. It is sometimes also useful to compare lines of business or business 
units within a single company. 

Throughout most chapters of this book the same data set will be examined. 
The data consist of three sources of loss and the sum of them. We make use 
of an artificial sample to provide with a guided path to implementation. Be- 
cause of the loss amounts have been artificially generated they do not have 
monetary dimension other than ‘monetary units’. Nevertheless, in several 
parts of the book we consider that these loss amounts are the cost of claims 
in thousands of euros for three types of damage, let us say property damage, 
bodily injuries and medical expenses. Table 2.1 and Table 2.2 show some sta- 
tistical characteristics of the sample, and Figure 2.1 presents a visual view of 
the three variables and their sum. 

There are some interesting features about the example data that are pre- 
sented in this chapter. All the variables are asymmetric with a few large 
values and many small values. The maximum values for both X; and X2 
are much higher than the average. The minimum values are always strictly 
positive. We also note that the minimum of the sum is not the sum of the 
minimum and this is due to the fact that the data are paired, and the min- 
imum of each marginal does not necessarily correspond to the minimum 
observed value of the other marginal. 


When comparing the data on Figure 2.1, we note that the size of the losses 
and the shape of the histograms are similar between sources X1 and X2 but 
not between them and source X3. Additionally, we cannot see the depen- 
dence patterns between the sources unless we look into the bivariate plots. 
The joint behavior is presented in Figure 2.2. 


Table 2.1 Statistical summary of the example loss data 


Data Mean Median Std. Deviation Maximum 
Xı 0.707 0.033 3.778 49.812 
X2 0.450 0.033 3.369 52.129 
X3 0.268 0.105 0.361 1.775 
Xi +X2+X3 1.426 0.221 5.6797 103.716 


There are 350 losses for each variable 
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Table 2.2 Statistical summary of the example loss data (part II) 
Data Minimum Variance Skewness Kurtosis 
Xı 0.001 14.273 8.875 94.353 
Xo 0.001 11.350 12.401 171.107 
X3 0.003 0.130 2.119 4.296 
X 1+ X24+ X3 0.026 46.199 11.354 153.368 


There are 350 losses for each variable 


Figure 2.1 Histograms of loss data originating from sources X1, X2, X3 and their 


sum 
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Basic graphical analysis of the loss severity 


distributions 


We can measure risk because different outcomes occur with different proba- 


bilities. Loss Distribution Analysis (LDA) is a method used to calculate and 
evaluate the risk that is induced by losses that occur randomly. Historical 
data can be helpful to predict future behavior, and so it is important to ana- 


lyze its density shape. 
Modeling the severity of losses usually involves analysis of parametric dis- 
tributions such as the Normal distribution and all other distributions that 
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Figure 2.2 Dependence from sources X1, X2, X3 and their sum 


Simple Scatterplot Matrix, Z2=X1+X2+X3 
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have been mentioned in the previous chapter. However, it is very difficult 
to find a parametric distribution that fits all sizes of loss: (i) low severities 
occur frequently, (ii) high severities occur much less often, and (iii) catas- 
trophic losses only occur very rarely. They are usually referred as rare losses 
or rare events. 

This section presents the analysis of one parametric distribution that can 
be used to fit the behavior observed in the sample data. Some other basic 
parametric distributions for severity not considered here were presented in 
the previous chapter. 

We have estimated the Normal densities using maximum likelihood in all 
variables, X1, X2, X3 and the sum of the three. Some results are shown in 
Figures 2.3 to 2.5. In there, we present the parametric Normal pdf estimate 
and the original data, the former appearing as dashed lines. On the left, the 
plots show the lower values of the observed sample and on the right the 
higher observed values are presented. Low and high values are split in order 
to see the parts of the densities where data are scarce. Only a simple para- 
metric density estimate is presented in this chapter, but practitioners often 
use more sophisticated distributions. The empirical distribution function 
obtained for the sample data is also a possibility, and in this case it is not 
necessary to fit any distribution in particular. 

Figures 2.3 to 2.5 show the maximun likelihood estimation of the Normal 
densities for the data. The right hand side tail is presented. The density 
may still be too low for large losses. 
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Figure 2.3 The estimated density for the X; data using the Normal distribution 


X1 - Normal curve over histogram X1 - Normal curve over histogram (large losses) 
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Figure 2.4 The estimated density for the X2 data using the Normal distribution 


X2 - Normal curve over histogram X2 - Normal curve over histogram (large losses) 
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Figure 2.5 The estimated density for the X3 data using the Normal distribution 
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2.3 Quantile estimation 


Quantile estimation for the parametric distribution presented in the previ- 
ous section, other distributions or the empirical distribution functions can 
be performed using the inverse of the distribution function evaluated at the 
maximun likelihood parameter estimates. This is one of the most basic risk 
assessment tools that will be used throughout this book and, if one think 
about Figure 1.2, this is the sort of tools necessary to move from theory to 
practice. 


2.4 Examples 


Illustrative examples of quantitative analysis are developed in the program- 
ming language R. The dataset used in this Chapter (exampleERM) is avail- 
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able at http: //www.ub.edu/rfa/ERM. 


1. Download and save the data in an appropriate directory. Prepare the 
data: 
example < — read.table(”exampleERM.txt”, header=TRUE) 
riskloss < — as.matrix(example) 


2. Descriptive analysis in R. The number of observations and a set of de- 
scriptives can be computed as follows: 
n < — nrow(riskloss) 
summary(riskloss) 
E.di < — colMeans(val.di) 
Cov.di < — var(val.di) 


library(moments) 
skewness(riskloss[ ,”X1”]) 
kurtosis(riskloss[ ,”X1”]) 


3. Graphical analysis in R. Compute the histogram of X1 and scatterplots as 
follows: 
hist(riskloss[,”X1”], 20) 
lines(density(riskloss[,”X1”]),lwd=3) 


plot(riskloss[ ,”X1”], type="1", ylim=c(0,10)) 
lines(riskloss[ ,”"X2”], Ity=2) 
lines(riskloss[ ,”"X3”], lty=3) 


4. Compute the VaRgo% (X 1) and CTEgo% (X 1) 
riski < — riskloss[,1] 
Eriski < — mean(risk1) 
Vriskı < — var(risk1) 


VaRgo < — quantile(riski,0.90) 
CTEgo < — mean(riski[riski>VaRgo]) 


3 A family of distortion risk 
measures 


Value at Risk (VaR) has been adopted as a standard tool to assess the risk and 
to calculate capital requirements in the insurance industry. As it has been 
shown in Chapter 1, VaR at level @ is the a-quantile of a random variable X 
(which is often called loss). Recalling Definition 1.8 


VaRg (X) = inf{x | Fx (x) = a}= Fz" (a), 


where Fy is the cumulative distribution function (cdf) of X and g is the con- 
fidence or the tolerance level 0 < æ < 1. However, VaR is known to present 
a number of pitfalls when applied in practice. A disadvantage when using 
VaR in the insurance or financial contexts is that the capital requirements 
for catastrophic losses based on this measure can be underestimated, i.e. the 
necessary reserves in adverse scenarios may well be less than they should be. 
The underestimation of capital requirements may be aggravated when fat- 
tailed losses are incorrectly modeled by mild-tailed distributions, such as 
the Normal distribution. There are attempts to overcome this kind of model 
risk when using VaR or, at least, to quantify the risk related to the modelling 
[Alexander and Sarabia, 2012]. But, in addition, a second drawback is that 
the VaR may fail the subadditivity property. As it has been shown in Ta- 
ble 1.2 of Chapter 1, a risk measure is subadditive when the aggregated risk 
is less than or equal to the sum of individual risks. Subadditvity is an ap- 
pealing property when aggregating risks in order to preserve the benefits of 
diversification. VaR is subadditive for elliptically distributed losses [McNeil 
et al., 2005]. However, the subadditivity of VaR is not granted, as indicated 
in Artzner et al. [1999] and Acerbi and Tasche [2002]. 

Remember now Definition 1.9 from Chapter 1. In there Tail Value at Risk 
(TVaR) has been defined as 


1 1 
TVaRq(X) = =S VaR, (X)dA. 
l-a a 
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Table 3.1 VaRg5% and TVaRg5y, illustration 


VaRg5% TVaRg 5% 


Empirical 47.6 125.5 
Normal 87.0 105.9 
Lognormal 48.9 119.1 


* Cost of claims in thousands of Euro 


Roughly speaking, the TVaR is understood as the mathematical expectation 
beyond VaR. The TVaR risk measure does not suffer the two drawbacks dis- 
cussed above for VaR and, as such, would appear to be a more powerful mea- 
sure for assessing the actual risks faced by insurance companies and finan- 
cial institutions. However, TVaR has not been widely accepted by practition- 
ersin the financial and insurance industry. VaR is currently the risk measure 
contemplated in the European solvency regulation for the insurance sector 
(Solvency II), and this is also the case of solvency regulation for the bank- 
ing sector (Basel accords’). The TVaR measures average losses in the most 
adverse cases rather than just the minimum adverse loss, as the VaR does. 
Therefore, capital reserves based on the TVaR have to be considerably higher 
than those based on VaR and significant differences in the size of capital re- 
serves can be obtained depending on which risk measure is adopted. 

An illustration of the risk value obtained for the VaRgs5% and TVaRg5% con- 
sidering three alternative distributions is provided in Table 3.1. Note that 
huge differences in risk amounts can be obtained. 

This chapter is motivated by the following question. Can a risk measure be 
devised that would provide a risk assessment that lies somewhere between 
those offered by the VaR and the TVaR? To this end, a new family of risk 
measures (GlueVaR) is proposed, which forms part of a wider class referred 
to as distortion risk measures. 

GlueVaR risk measures are defined by means of a four-parameter function. 
By calibrating the parameters, GlueVaR risk measures can be matched to a 
wide variety of contexts. Specifically, once a confidence level has been fixed, 
the new family contains risk measures that lie between those of VaR and 


1 Although it seems that changing VaR by TVaR with a lower confidence level is something 
that is really under consideration for regulatory capital requirements in the Banking sec- 
tor. 
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TVaR and which may adequately reflect the risk of mild-tailed distributed 
losses without having to resort to VaR. In certain situations, however, even 
more conservative risk measures than TVaR may be preferred. It is shown 
that these highly conservative risk measures can also be defined by means 
of the GlueVaR family. In order to preserve the benefits of diversification 
when aggregating risks, subadditivity is an appealing property ofa risk mea- 
sure. As it has been shown in Chapter 1, the subadditivity property ensures 
that the risk measure value of the aggregated risk is lower than or equal to 
the sum of individual risk measure values. In this chapter the subadditivity 
property of GlueVaR risk measures is investigated. 


3.1 Overview on risk measures 


Two main groups of axiom-based risk measures are coherent risk measures, 
as stated by Artzner et al. [1999], and distortion risk measures, as introduced 
by Wang [1996]. Concavity of the distortion function is the key element to 
define risk measures that belong to both groups [Wang and Dhaene, 1998]. 
Suggestions on new desirable properties for distortion risk measures are 
proposed in Balbas et al. [2009], while generalizations of this kind of risk 
measures can be found, among others, in Hiirlimann [2006] and Wu and 
Zhou [2006]. As shown in Goovaerts et al. [2012], it is possible to link dis- 
tortion risk measures with other interesting families of risk measures devel- 
oped in the literature. 

The axiomatic setting for risk measures has extensively been developed since 
seminal papers on coherent risk measures and distortion risk measures. Each 
set of axioms for risk measures corresponds to a particular behavior of deci- 
sion makers under risk, as it has been shown, for instance, in Bleichrodt and 
Eeckhoudt [2006] and Denuit et al. [2006]. Most often, articles on axiom- 
based risk measurement present the link to a theoretical foundation of hu- 
man behavior explicitly. For example, Wang [1996] shows the connection 
between distortion risk measures and Yaari’s dual theory of choice under 
risk; Goovaerts et al. [2010] investigate the additivity of risk measures in 
Quiggin’s rank-dependent utility theory; and Kaluszka and Krzeszowiec [2012] 
introduce the generalized Choquet integral premium principle and relate it 
to Kahneman and Tversky’s cumulative prospect theory. 

Many articles have appeared in recent years that pay attention to risk mea- 
sures based on distortion functions or on generalizations of the quantiles. 
An example of the first group is Zhu and Li [2012]. Bellini and Gianin [2012] 
and Bellini et al. [2014] fit to second group. An interplay between both groups 
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is found in Dhaene et al. [2012a] and Goovaerts et al. [2012]. 


3.2 Distortion risk measures 


Distortion risk measures were introduced by Wang [Wang, 1995, 1996] and 
they are closely related to the distortion expectation theory [Yaari, 1987]. A 
review on how risk measures can be interpreted from several perspectives 
is provided in Tsanakas and Desli [2005], and a clarifying explanation of the 
relationship between distortion risk measures and distortion expectation 
theory is included. A detailed literature review of distortion risk measures 
is available in [Denuit et al., 2005; Balbas et al., 2009]. There are two key 
elements to define a distortion risk measure: first, the associated distortion 
function; and, second, the concept of the Choquet [ Choquet, 1954] Integral. 
The distortion function, Choquet Integral and the distortion risk measure 
concepts can be defined as follows: 


- Distortion function. Let g : [0,1] — [0,1] be a function such that g(0) = 
0, g(1) = 1 and g is injective and non-decreasing. Then g is called a dis- 
tortion function. 


* Choquet Integral. The (asymmetric) Choquet Integral with respect to 
a set function u of a u-measurable function X : Q — R is denoted as 


f Xdgu andis equal to 


+00 


0 
[xau=[ [Sux -Max f Sp, x(x)dx, 


if u(Q) < oo, where Su, x(x) = u({X > x}) denotes the survival function 
of X with respect to u. Note that Q denotes a set, which in financial and 
insurance applications is the sample space of a probability space. A set 
function pin this context is a function defined from 2° (the set of all sub- 
sets of Q) to R. A -measurable function X is, widely speaking, a function 
defined on Q such that expressions like u({X > x}) or u({X < x}) make 
sense. See Denneberg [1994] for more details. 

- Distortion risk measure. Let g be a distortion function. Consider a ran- 
dom variable X and its survival function Sx (x) = P(X > x). Then, func- 
tion pg defined by pg(X) = f°. [g(Sx) -1] dx+ f$” g(Sx(x)dx 
is called a distortion risk measure. 


From the previous definitions, it is straightforward to see that for any ran- 
dom variable X, p g(X) is the Choquet Integral of X with respect to the set 
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function u = go P, where P is the probability function associated with the 
probability space in which X is defined. 

The mathematical expectation is a distortion risk measure whose distortion 
function is the identity function [Denuit et al., 2005], this is, pia (X) = E(X). 
Therefore, a straightforward way to interpret a distortion risk measure is 


as follows: first, the survival function of the random variable is distorted 
(go Sx); second, the mathematical expectation of the random variable with 
respect to this distorted probability is computed. From a theoretical point 
of view, note that this interpretation fits the discussion that risk may be de- 
fined as an expected value in many situations [Aven, 2012]. 

VaR and TVaR measures are in fact distortion risk measures. The associated 
distortion functions of these risk measures are shown in Table 3.2. 


Table 3.2 VaR and TVaR distortion functions 


Risk measure Distortion function 


0 if0<u<l-a 


VaR Yalu) = 
l1 ifl-a<us<l, 
u 
— if0<u<l-a 
TVaR Yalu) = l-a 
1 ifl-as<u<l 


For a confidence level a € (0,1). 


Based on the distortion functions shown in Table 3.2, once a@ is fixed it can 
be proved that VaRqg(X) < TVaRq(X)for any random variable X. 


Remark 3.1. Let g and g* be two distortion functions and let pg and p g* 
be their respective distortion risk measures. Suppose that g(u) < g* (u) for 
all u € [0,1]. Then pg(X) < pg* (X) for any random variable X. 

This result follows immediately from the definition of distortion risk mea- 
sures, because 


+00 


0 
p= | ig(Sx)— Na+ f g(Sx(x))dx 


0 +00 
<Í [g* (Sx(x))- Ula + | g* (Sx(x%))dx 


= pg (X). 
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Many articles have recently examined risk measures based on either distor- 
tion functions [Zhu and Li, 2012; Belles-Sampera et al., 2013a, 2014a, 2016b; 
Guillen et al., 2016; Tsanakas and Millossovich, 2016] or generalizations of 
the quantiles [Bellini and Gianin, 2012; Bellini et al., 2014]. The interplay be- 
tween both of these two groups of risk measures has been examined [Dhaene 
et al., 2012a; Goovaerts et al., 2012]. 


3.3 Anew family of risk measures: GlueVaR 


A new family of distortion risk measures, named GlueVaR, is here defined. 
Originally, we introduced this family in Belles-Sampera et al. [2014a]. The 
main reason for defining these GlueVaR risk measures is a response to the 
concerns expressed by risk managers regarding the choice of risk measures 
in the case of regulatory capital requirements. However, as it has been al- 
ready mentioned, an axiomatic approach to define or represent risk mea- 
sures is more frequent in the literature [Artzner et al., 1999; Follmer and 
Schied, 2002; Frittelli and Rosazza Gianin, 2002; Denuit et al., 2006; Song 
and Yan, 2009; Cerreia-Vioglio etal., 2011; Ekeland et al., 2012; Goovaerts et al., 
2012; Grechuk et al., 2012]. 

Any GlueVaR risk measure can be described by means of its distortion func- 
tion. Given a confidence level a, the distortion function for GlueVaR is: 


h 
i ik ifO<u<1- 6 
Isha (yy — ho —h 
Kga W= m+- YO-A if 1-B<u<i-a (31) 
1 ifl-asus<l 


where @, p € [0,1] such that a < p, hy € [0,1] and hg € [h;, 1]. Parameter 
Ê is the additional confidence level besides a. The shape of the GlueVaR 
distortion function is determined by the distorted survival probabilities hı 
and hz at levels 1 — 6 and 1 — a, respectively. We call parameters h; and h2 
the heights of the distortion function. 

A wide range of risk measures may be defined under this framework. Note 
that VaRq and TVaRg are particular cases of this new family of risk measures. 
Namely, VaRq and TVaRg correspond to distortion functions oe and is 
respectively. By establishing suitable conditions on the heights hı and ho, 
the GlueVaR family is very flexible. For example, risk managers might like 
to select a, p, hı and hz so that 


VaRq(X) < GlueVaR g” (X) < TVaRg (X). 
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This can be achieved by selecting a set of parameters for their associated 
distortion functions to ensure that Wa(u) < si (u) < Ya(U) for any u € 

1-B 


[0, 1], following remark 3.1, i.e. by forcing condition hı < — An example 

of such a case is shown in Figure 3.1 (left-hand side). 

The GlueVaR family also allows us to define a highly conservative risk mea- 
he such that 


sure GlueVaR 6 
TVaRa (X) < GlueVaR 5!” (X) < TVaRg(X) 
hı,h2 . r 
Bia 1s concave 1n 


for any X and that the associated distortion function x 
< hı and hz = 1 must be fulfilled, as occurs in the 


[0, 1]. In this case, = 
example shown in Figure 3.1 (right-hand side). 


Figure 3.1 Examples of GlueVaR distortion functions. 
Left. Distortion function is concave in [0, 1 — œ) and VaRq (X) < GluevaR?)" (X)< 
TVaR q (X) for a random variable X; 


Right. Distortion function is concave in the whole range [0, 1] and TVaRg (X) < 
we (X) < TVaRg (X) for a random variable X. 
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— -- Distortion function for TVaR, risk measur 
= -= -~ Distortion function for VaR, risk measure 


A : ae: hh, 
Distortion function for GlueVaR; a risk measure 


Distortion function for TVaR, risk measure 


3.4 Linear combination of risk measures 


Given a random variable X and for fixed tolerance levels a and f so that a < 
B, GlueVaR ji” (X) can be expressed as a linear combination of TVaRg(X), 
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TVaRq(X) and VaRq(X). This result allows us to translate the graphical- 
based construction of GlueVaR risk measures into an algebraic construction 
based on standard risk measures. 

If the following notation is used, 


_ (hg=hy)- (= B) 


Wy h 
h — 4 
pa e Msn (3:2) 
p-a 


w3 = l- wı- w = l- hs, 


then the distortion function x beh he Cu) in (3.1) may be rewritten as (details 
can be found in Section A.1 of the Appendix): 


Kgy? (U) = 01 -Yp(U) + 2° Ya (U) + 03° Walw) (3.3) 


where Yg, Ya» Wa are the distortion functions of TVaR at confidence lev- 
els p and q and of VaR at confidence level q, respectively (see Table 3.2). 
Therefore GlueVaR is a risk measure that can be expressed as a linear com- 
bination of three risk measures: TVaR at confidence levels 6 and a and VaR 
at confidence level a, 


GlueVaR g? (X) = wı-TVaRg(X)+w2:TVaRg(X)+w3:VaRa (X). (3.4) 


Given this relationship, an alternative notation for GlueVaR g’ A (X) and its 
related distortion function can be used. The notation GlueVaR®: (X) or 
Kea (u) may, on occasions, be preferred to that based on heights hı and 
hz. The bijective relationship between pairs (h1, h2) and (w1, w2) is also 
shown in Section A.2 of the Appendix. 

Specifically, in order to simplify the statement of Proposition 4.1, the expres- 
sion of Kga 2 (u) is 


wW w 
T i 2? fu ifO<u<l-f 
@1,02 w 
K b,a (u) w+ 2 cü if 1-Bsu<l-a (35) 
-a 
1 ifl-asus<l 


An interesting interpretation of (3.4) in the context of decision making and 
risk management is that GlueVaR risk measures arise as a linear combina- 
tion of three possible scenarios. So, two levels of severity tolerance can be 
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fixed, namely a@ and p, with a < p. Then, the risk can be measured in the 
highly conservative scenario with TVaR at level ĝ; in the conservative sce- 
nario with TVaR at level a; and in the less conservative scenario with VaR at 
level a. 

Each combination of these risk scenarios reflects a concrete risk attitude. 
Therefore, it can be said that the combination of these risk scenarios in this 
context is something that is directly identified by an explicit GlueVaR risk 
measure. To some extent, these risk attitudes could be related to risk ap- 
petite [ Aven, 2013]. 

From the practitioner's point of view, four parameters must be fixed in or- 
der to define the GlueVaR risk measure. The @ and p values correspond to 
the confidence levels used for bad and very bad scenarios, respectively. For 
instance, œ = 95% and f = 99.5% could be selected, which are equivalent 
to one bad event every twenty years or one bad event every two hundred, 
respectively. The other two parameters are directly related to the weights 
given to these scenarios. For instance, it could be said that the three com- 
ponents of GlueVaR in expression (3.4) are equally important. This would 
imply wı = w2 = w3 = 1/3, so the corresponding hı and hz parameters 
could be found. When w1 = w2 = w3 = 1/3 and a = 95%, p = 99.5%, these 
parameters are hy = 11/30 and h2 = 2/3. 


3.5  Subadditivity 


In a seminal article [Artzner et al., 1999] the following set of axioms that a 
risk measure should satisfy was established: positive homogeneity, trans- 
lation invariance, monotonicity and subadditivity. Authors referred to such 
risk measures as coherent risk measures. Distortion risk measures always sat- 
isfy the first three properties, but subadditivity is only guaranteed when the 
distortion function is concave [Denneberg, 1994; Wang and Dhaene, 1998; 
Wirch and Hardy, 2002]. Therefore, VaR, unlike TVaR, is not coherent. In 
some situations, coherence of risk measures is a requirement [Cox, 2012] 
but, nonetheless, some criticisms can be found [Dhaene et al., 2008]. Addi- 
tional properties for distortion risk measures are provided in [Jiang, 2008; 
Balbas et al., 2009], which may complement the list of properties for risk 
measures shown in Tables 1.2 and 1.4 of Chapter 1. In this section we focus 
on the subadditivity property. 

In order to preserve the benefits of diversification when aggregating risks, 
subadditivity is an appealing property of a risk measure. As it has been 
shown in Chapter 1, the subadditivity property ensures that the risk mea- 
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sure value of the aggregated risk is lower than or equal to the sum of indi- 
vidual risk measure values. For distortion risk measures, subadditivity may 
be defined as follows. 


Definition 3.1. Given a confidence level a € [0, 1], a distortion risk measure 
Pg is subadditive if, for any pair X, Y, 


fx- VdlgoP)< | Xa(gor)+ | Yaigo), 


where the integral symbol stands for Choquet Integrals with respect to the 
set function go P. 


The Choquet integral condition used in the definition can be rewritten, in 
terms of survival functions and Lebesgue integrals, as 


+00 


+00 +00 
[ g(Sxivades | g (Sx dx+ | g(Sy(y)) dy. 


As shown, GlueVaR risk measures may be interpreted as a linear combina- 
tion of VaR and TVaR risk measures. Therefore, a GlueVaR risk measure is 
coherent when the weight assigned to VaR is zero and the weights of the 
TVaRq and TVaRg are non-negative. In terms of the parameters of the dis- 
tortion function, GlueVaR is subadditive (and thus coherent) if hz = 1 and 


I=L 


< hı. More generally, any property satisfied by TVaR but not by VaR 
will be inherited by GlueVaR if wı > 0 and w3 = 0 in expression (3.2). 
Subaddtitivity in the whole domain is a strong condition. When dealing 
with fat tail losses (i.e. low-frequency and large-loss events), risk managers 
are especially interested in the tail region. Fat right-tails have been exten- 
sively studied in insurance and finance [ Wang, 1998; Embrechts et al., 2009a,b; 
Degen et al., 2010; Nam et al., 2011; Chen et al., 2012] and the behavior of ag- 
gregate risks in the tail region has received huge attention by researchers in 
last years [Cheung, 2009; Song and Yan, 2009; Hua and Joe, 2012]. To the 
best of our knowledge, however, previous studies of the subadditivity of risk 
measures in the tail region are scarce [Danielsson et al., 2005; Hua and Joe, 
2012]. 


3.6 Concavity of the distortion function 


The subadditivity characteristic in the whole domain is in general not sat- 
isfied by GlueVaR risk measures. It was showed that GlueVaR risk measures 
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can be interpreted as a linear combination ofa highly conservative scenario, 
a conservative scenario and a less conservative scenario. We argued that a 
particular risk attitude is reflected depending on how these scenarios are 
weighted. 

Given q and P, the other two parameters are directly related to the weights 
given to these scenarios. The shaded areas in Figure 3.2 delimit feasible 
weights (Ww 1,W2) for GlueVaR gg”. For instance, it could be said that the 
three components of GlueVaR i in expression (3.4) are equally important, that 
is, 1 = w2 = w3 = 1/3. The point (1/3, 1/3) in Figure 3.2 corresponds to the 
balanced risk attitude on the part of risk managers when faced by the three 
components shown in (3.4). The corresponding distortion function x re 
is concave on [0,1 — æ) in the lightly shaded area. Yet, the distortion func- 
tion is not concave on [0,1 — æ) in the darkly shaded area. The distortion 
function is concave in [0,1] in the boldest continuous segment and, thus, 
the associated GlueVaR risk measure is subadditive. 

Note that any pair of weights (w1, W2) on the boldest line in Figure 3.2 leads 
to w3 = 0. This means that a zero weight is allocated to the least conserva- 
tive scenario, i.e. the one associated with the VaRg(X). This is indicative 
of the decision makers’ conservative approach. Nonetheless, differences in 
just how restrictive this conservative attitude is can be found among the 
pes, 5], the less 
p-a p-a 


restrictive it is, while the nearer to (w1,w2) = (1,0), the more conservative 


weights lying on this line: the nearer to (w,W2) = | 


it is. 
If w < 0, risk managers are optimistic regarding the impossibility of the 
occurrence of the worst case scenario, and so attach a negative weight to it. 


3.7 Example of risk measurement with GlueVaR 


Data for the cost of claims involving three type of damages described in 
the previous chapter are used to illustrate the application of GlueVaR risk 
measures in risk measurement. The sample consists of n = 350 observa- 
tions of the cost of individual claims in thousands of euros. In Table 3.3 a 
set of quantile-based risk measures including three different GlueVaR are 
displayed. The table displays the corresponding risk figures for the cost of 
claims for property damage (Xj), the cost of claims of bodily injuries (X2), 
the cost of claims of medical expenses (X3) and the aggregate cost of claims 
(Xy +X + X3). 

The selection of the three GlueVaR risk measures included in Table 3.3 de- 
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Figure 3.2. Given a@ and f, the shaded areas delimits feasible weights (w1, w2) for 
GlueVaR iy 


O, 


f= l Heh 
p-a p-a N 


a (0,1) |TVaR, 


TVaR, 


VaR, (0.0) [E io) (1.0) 


serves further explanation. The two confidence levels considered are a = 
95% and p = 99.5%. The heights (hı, h2) are (11/30, 2/3), (0, 1) and (1/20, 
1/8) respectively. Different attitudes in front of the three scenarios of risk as- 
sessment are represented. GlueVaRgg so/'o5% corresponds to a balanced at- 
titude because the three quantile-based risk measures TVaRgg,5%, TVaRg5% 
and VaRgsy are equally important, i.e. Ww] = w2 = w3 = 1/3. A different atti- 
tude is symbolized by GlueVaRQ} 5%,95% 
w2 = 10/9 and w3 = 0. It corresponds to a scenario in which the manager 
overweights TVaRg5% and allocates the lowest feasible weight to TVaRg9.5% 
given that a zero weight is allocated to VaRg5%. Finally, GlueVaRjy.s), a: 
reflects a risk measurement attitude just a bit more conservative than the 


with associated weights w; = —1/9, 


one represented by using VaRg5%, assigning low weights to TVaRg9.5% and 


TVaRg5% (wi = 1/24 and w2 = 1/12). 


11/30,2/3 

99.5%,95% 
other two selected GlueVaR risk measures. This result can be generalized to 
all situations because the associated distortion function of GlueVaR}} oy or, 
is greater than the other two distortion functions in the whole domain. Note 


that it is also observed in Table 3.3 that 


As it is shown in Table 3.3, GlueVaR, is more conservative than the 


1/20,1/8 0,1 
GlueVaR gg 59, 95% S GlueVaRgg 50,95% 


It is only valid to these data and an ordering between them can not be gen- 
eralized. However, a relationship between these two GlueVaR risk measures 
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Table 3.3 Quantile-based risk measures and subadditivity 


Xı Xə X X,+X2+X3 Difference? 


(a) (b) (c) (d)  (a+b+c-d) 
VaRo5% 25 06 Ll 5.9 -1.7 
TVaR95% 12.5 80 13 19.7 2.1 
TVaR99.5% 40.8 42.1 18 81.1 3.6 
GlueVaRgo szosy 18.6 16.9 1.4 35.6 1.9 
GlueVaRjgavosm 49 29 Ld 10.2 -1.3 
GlueVaR93.5q,95% 94 4.2 12 12.9 1.9 


© Benefit of diversification. 


and quantile-based risk measures can be established. It has been shown that 


ie 
VaRa < GlueVaRg < TVaR« if hı < = 


That means, 


0,1 
VaRg5% < GlueVaRo3 5% 95% < TVaRg5%, because 0 < 0.1, and 


1/20,1/8 
VaRg5% < GlueVaRgo 5%,95% < TVaRg5%, because 0.05 < 0.1. 


Although results in Table 3.3 may suggest that 


11/30,2/3 
TVaRg5% < GlueVaR oo s% 95% Š TVaRog9.5%, 


this can not be asserted in general because conditions on the parameters of 
the GlueVaR risk measure to satisfy 


TVaR« < GlueVaR 5” <TVaRg 


are f < hı and hz = 1. 
l-a 


In this case it holds that 0.1 < 0.37 but h2 #1. 
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Let us analyze the subadditivity property. Like the VaRg5%, note that the 
GlueVaR}q ave pos fails to be subadditive for X1, X2 and X3 since 4.9+2.9+ 
1.1 < 10.2. Let us emphasize that these three GlueVaR risk measures have 
not associated a concave distortion function in the whole domain, so the 
subadditivity property can not be guaranteed for any of them. The fact that 
f i 11/30,2/3 0,1 
risk values are subadditive for the GlueVaRgg zo g5 and GlueVaRgy 5%,95% 
is a characteristic attributable to this data but cannot be generalized to all 
11/30,2/3 1/20,1/8 
contexts. In the case of the GlueVaRgg s% 05% and GlueVaR oo 5% 95%! the 
associated distortion function is concave in [0, 1 — æ). 


3.8 Exercises 
1. Consider the following empirical distribution 
13, 15, 26, 26, 26, 37, 37, 100 


11/30,2/3 0,1 
85%,50% and GlueVaR ga, 50% 
Hint: The heights (hı, h2) equal to (11/30, 2/3) and (0, 1) correspond to 


(w = 1/3, w2 = 1/3) and (w1 = —1/9, w2 = 10/9), respectively. 


Determine the GlueVaR 


2. Consider the following distribution function F(x) = forO<x <3. 


; 1/20,1/8 
Find the GlueVaRg9 505, 9595" 


Hint: The heights (hı, h2) equal to (1/20, 1/8) correspond to wı = 1/24 
and w2 = 1/12. 


3. (Exemple 6.7 in [McNeil et al., 2005]) Consider a defaultable corporate 
bond. The default probability is equal to 2%. The current price of the 
bond is 100. If there is no default, a bond pays in t+ 1 an amount of 105, 
otherwise there is no payment. Hence L the loss of bond is equal to 100 
when the bond defaults and to —5 otherwise. Compute the VaRg5% of 
the following two portfolios: 


- Portfolio A consists of 100 units of this bond. 


* Portfolio B consists of 100 independent bonds with the same charac- 
teristics that this bond. 


Hint: The loss function of a bond can be expressed as L = 100 Y - 5(1 — 
Y) = 105Y —5, where Y is a indicator variable that takes value 1 if the 
bond defaults and 0 otherwise. 
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4. Let X and Y betwo independent random variables uniformly distributed 
between [0,1] and Z = X + Y their sum. Analyze if it holds that VaR25% 
(Z) < VaR25% (X1) + VaR25% (X2). 


Hint: The cumulative distribution function of Z is 


z2 
— if0<z<l 
Fz(z) = 2 2 


Z 
ae ifl<z<2 


4  GlueVaR and other new risk 
measures 


This chapter is structured in two parts. Analytical closed-form expressions 
of GlueVaR risk measures for commonly used statistical distributions in the 
insurance context are derived. These closed-form expressions should en- 
able practitioners to undertake an effortless transition from the use of VaR 
and TVaR to GlueVaR. Third order Cornish-Fisher approximations to Glue- 
VaR risk measures for general skewed distribution functions are also intro- 
duced in this chapter. Finally, the relationship between GlueVaR, Tail Dis- 
tortion risk measures and RVaR risk measures are shown. 


4.1 Analytical closed-form expressions of GlueVaR 


A useful consequence of (3.4) is that when analytical closed-form expres- 
sions of VaRg(X) and TVaRqg(X) are known for a random variable X, the 


closed-form expression of GlueVaR%” 


? (X) can automatically be derived 
without further complications. Otherwise, using the definition of GlueVaR 
as a distortion risk measure, the Choquet Integral of X with respect to the 


set function le o P should be calculated. 


4.1.1 Illustration: GlueVaR expression for Student t distribution 


Let X be arandom variable such that X = H is distributed as a Student 


t random variable with v degrees of freedom (df). In such a case, X has u 


laene 
Vo 
mean and a standard deviation equal to 5" Then 
vV — 


VaRq(X) = U+0: ta 
T(ta) (= ʻa) 


l-a \v-l 


TVaRq(X) =p+o: 
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where fq is the a-quantile of a Student t distribution with v df and T is its 
density function. 
Using (3.4) the GlueVaR of random variable X is 


2 
hish; T(t) [Y+ tp 
GlueVaR j, (X) = w1: Eto: 1-6 {= 
a v+ t2 
+W2:|U+o: = (= +(1-—@)—W2):(uUt+o: ta) 
2 
_ hy In-h wre 
=Uto: 5- p-a jo 
ho —h t2 
E Erta (2 2) + (1 ha) ta]. 
f- v-1 


4.1.2 Analytical expressions for other frequently used 
distributions 


Normal (^), Lognormal (£M) and Generalized Pareto (42) distribu- 
tions have simple closed-form expressions of GlueVaR. The same notation 
conventions that were introduced in Chapter 1 are used. Namely, ¢ and 
® stand for the standard Normal pdf and cdf, respectively. The standard 
Normal distribution œ and 6 quantiles are denoted as qa = ®~!(a) and 
dp = ® !(6). For the YF distribution, the definition provided in Hosking 
and Wallis [1987] is considered, where the scale parameter is denoted by o 
and k is the shape parameter. The 4# distribution contains the Uniform 
(k = 1), the Exponential (k = 0), the Pareto (k < 0) and the type II Pareto 
(k > 0) distributions as special cases. Closed-form expressions of GlueVaR 
for several distributions are presented in Table 4.1. Note that there are some 
exceptions to the general rule to deduce these closed-form expressions to 
be considered. When X follows a Pareto distribution with k < 1 and for 
any confidence level a, TVaRg (X) = as as we have shown in Table 1.3 of 
Chapter 1. But when hı = 0 GlueVaR”! Ba ue (X) is finite. There is a compensa- 
tion effect between TVaRg (X) and TVaRg(X). This is taken into account in 
Table 4.1. 
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Closed-form expressions of GlueVaR for some selected distributions 


Table 4.1 
eee GlueVaR 5, ahh expression 
he hy hy 
Normal: [+0-qa:(l1-ho)+a- Boa (4a) — P(4p)) + 0° 5 
2 
Ae) (ap) 
exXP(H +0: qa): (1— h2) 
: o°) hy-hy 
Lognormal: +exp|u+ å á [O(a qa) O(a qp)l 
LN (u, 0°) 3 k 
o 1 
+exp HE ep “D(a — qp) 
Exponen- i ae p 
tial: o-[h2-ln(1-a)]+0:(1 p-m] £). oes : | 
GP(k, 0), l-a p-a 1-6 


with k=0 
Continued on next page 
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Table 4.1: continued from previous page 


nei GlueVaR p,a”vhz expression 
a if k<—1, h, 40 
pe! 
27m g k k 
ta AA el p)k--a)*] 
hy — hı Oo k 
5 f +1 1 k+1 
-a k+1 [i= a- py" 
if k<-1, h =0 
| 
c-|— -1 
Pareto: ae i r 
GP(k,o), SAND iB | - 
withk<0 p-a 1-6 l-a 
hy—h 1- 
em in(—) if k=-1, h =0 
-4q 1-6 
= [1-(l-a)*] 
o (ho-h h 
+ | mile 5} 1a a)*.(1—B)] 
k \p-a 1-8 
ho-h, o | k-Q-a)*! 
p-a k k+1 
ho—h — pyktl 
| DE |Z. E ke(-1,0) 
B-a 1-6) k | k+l 
Iy-h, h 
E 6 er fem | cl Ja a)*-(1-B)] 
k k \ b-a 1-8 
Type II 
Pareto: 2 h o k-(1—a@)**! 
GP(k,o), b-a k k+l 
with k>0 (eae he o a-p)! 
b-a 1-6) k | k+l 


4.1.3 The Cornish-Fisher approximation of GlueVaR 


General considerations about Cornish-Fisher VaR methodologies have been 
pointed out in Section 1.2.2 of Chapter 1. Approximations to GlueVaR risk 
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measures for general skewed distribution functions using a Cornish-Fisher 
expansion of their quantiles are provided in this section. In insurance ap- 
plications managers often have to face to highly skewed random variables 
with right fat tails. In many of these situations, however, they do not know 
whether the underlying random variable of interest is distributed according 
to a known parametric distribution function. In those situations that the 
distribution is unknown, the value of the common quantile-based risk mea- 
sures is routinely approximated by practitioners. It is shown that approxi- 
mations of GlueVaR risk measures for general unknown skewed distribution 
functions can be directly obtained by means of the relationship of GlueVaR 
risk measures and the standard quantile-based risk measures. 


The Cornish-Fisher expansion is widely used by practitioners to approxi- 
mate the VaRg (X) and TVaRg (X) values when the random variable follows 
a skewed unknown distribution [see Cornish and Fisher, 1937; Fisher and 
Cornish, 1960; Johnson and Kotz, 1970; McCune and Gray, 1982]. The VaR 
and TVaR measure values can be approximated as VaRa (X) = HU + quad 
and TVaRq(X) = M+ Gtv,ad, where u = E[X], a? = V[X] and both qua 
and qtv,a are modified quantiles of the standard Normal distribution that 
take into account the skewness of the distribution function of X. 


Following Sandström [2007], the modified quantiles gy,q and qtv,a are com- 
puted as follows. Let us consider y = E[(X — u)]/o° as a measure of the 
skewness of the random variable X. If qa = ®' (a) and ġ are the a-quantile 
and the density function of the standard Normal distribution, respectively, 
then qy,q and Gtv,q can be written as, 


dua = 2 (a) +} AG a)’ -1] = da+t (qe), 


D-t (a)) 
HOH) Yorke 


dtva = i ome = [1+ +Xq J 


Extensions of the Cornish-Fisher expansion that consider moments of higher 
order than y have been provided in the literature [see, for instance, Gi- 
amouridis, 2006]. More details can be found in Appendix B of Sandström 
[2011]. 


Given the interpretation of a GlueVaR risk measure as a linear combination 
of risk measures which was shown in (3.4), the approximation for the Glue- 
VaR of the random variable X following the Cornish-Fisher expansion can 
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be obtained as 
hy,h 
GlueVaR Ba *(X) =p+o 
| h2- hy 
+ 
p-a 
The error of the approximation is upper bounded by the maximum error 


incurred when approximating VaR« (X), TVaRq (X) and TVaRg(X) using the 
equivalent Cornish-Fisher expansion for skewed distributions. This result is 


es 
1-6 p-a 
| (qa) (1+ Za) +0- ( (a2 -1) + 4) 


janli ta 


straightforward. It follows from the linear relationship shown in expression 
(3.4) and taking into account that weights w1, w2 and w3 are lower or equal 
than one, satisfying that wı + w2 + w3 = 1. 


4.2 Onthe relationship between GlueVaR and Tail 
Distortion risk measures 


As it has been aforementioned, different works that pay attention to risk 
measures based on distortion functions or based on several generalizations 
of quantiles have appeared in recent years. See, for instance, Zhu and Li 
[2012]; Bellini and Gianin [2012]; Bellini et al. [2014]; Dhaene et al. [20124] 
and Goovaerts et al. [2012]. 

This section is devoted to reveal the connections between GlueVaR risk mea- 
sures and Tail Distortion risk measures. To the best of our knowledge, Tail 
Distortion risk measures were introduced in Zhu and Li [2012]. Here the no- 
tation used for these family of risk measures is adapted from that in Lv et al. 
[2013]. Consider a distortion function g, this is anon-decreasing and injec- 
tive function g from [0,1] to [0,1] such that g(0) = 0 and g(1) = 1, anda 
confidence level a € (0, 1). The Tail Distortion Risk Measure T g q associated 
to g and q is defined as the distortion risk measure with distortion function 
Za, where 


gaw =g- a )-10< u<l-a]+1[l-a<su<l]. 


-a 
Note that 1[0 < u < 1-— qa] is a function that takes value 1 in the interval 
[0 < u < 1-—a] and 0 elsewhere. In other words, if X is a random variable 
representing a loss in a probability space (Q, £, P) and its survival function 
is Sx(x) = P(X > x), then 


+00 


0 
TyatX)= | [ga(Sx(x))- 1Jax+ | Sa (Sx(x)) dx. (4.1) 
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Note that gq is continuous in 1 — @ or, alternatively, ga (1 — æ) = 1. 


W),W2 
ed 

, w and w2. This GlueVaR is equivalent to a Tail Distortion risk measure To q 
q g 


if, and only if w2 = 1 — w; and 


Proposition 4.1. Consider a GlueVaR risk measure with parameters Q, 


wi =a) 
1-6 
+w t l-0) H- LA-A- pts]. (4.2) 


g(t) = +1-@]|-t-1[0<t<(-a)!-(1-p)] 


The proof is provided in Section A.3 of the Appendix. 

Note that only GlueVaR risk measures with w3 = 0 can be represented as 
Tail Distortion risk measures, because w + W2 +W3 = 1 must hold as part of 
the definition ofa GlueVaRg risk measure. In other words, one can only 
represent as Tail Distortion risk measures those GlueVaR that do not weight 
the part corresponding to the VaRq. 

The origin of GlueVaR risk measures can be found in Belles-Sampera [2011]. 
As a curiosity, the definition of a parametric family of risk measures named 
PUp-TVaR can also be found therein, which are exactly the Tail Distortion 
risk measures linked to Proportional Hazards Distortion functions g(u) = 
ua, a> 1 from the perspective of Zhu and Li [2012]. 


4.3 Onthe relationship between GlueVaR and RVaR 
risk measures 


To the best of our knowledge RVaR risk measures were introduced in Cont 
et al. [2010]. This section is dedicated to show a close relationship between 
GlueVaR risk measures and the RVaR family. We have recently discovered 
this connection. It is highly probable that some synergies between the re- 
search associated to both families are going to arise. For instance, existing 
results related to capital allocation principles using RVaR [see, for instance, 
Embrechts et al., 2016] and the ones obtained with GlueVaR (which we are 
going to present in Chapter 7) may be interconnected. Another example of 
these synergies may be found in the analysis of risk attitudes that we present 
in Chapter 5 because, as we will show, the application of our assessment 
tools are straightforward in RVaR risk measures. 

Let X be an absolutely continuous random variable, which positive values 
represent losses. Let a, p € [0,1]. The value of the risk measure RVaRa,p 


58 RISK QUANTIFICATION AND ALLOCATION METHODS FOR PRACTITIONERS 


applied to X is, by definition, the following: 
1 a+b 
— VaR, (X)dy if B>0 
RVaRa g(X)=4 B Í ayay p (4.3) 
VaRa(X) if B=0 


The notation used for VaR is not the one used when defining GlueVaR risk 
measures, because VaRg (X) in (4.3) refers to the 100(1—a)% quantile while 
in GlueVaR definition the notation VaRg(X) is used to representing the 
100a% quantile. In order to find relationships between RVaR and GlueVaR 
risk measures it is convenient to rewrite expression (4.3) as 


1 rats 
Watel ZB Í VaRı-y(X)dy if B>0 


(4.4) 
VaRı-a(X) if B=0 

The notation used in definition of GlueVaR risk measures for VaR is incor- 
porated in (4.4). 
Note that 

1 pats 1 rice 

al VaRı-y(X)dy = — VaR, (X)dA 

B Ja P Ji-(a+p) 


1 1 
a |f vak, 0042- Í VaR, 00d 
B | Ji-(a+p) l-a 


notation used in def, of _ 1 
GlueVaR = B [(a + B)TVaRy~(a+p)(X) - aTVaRj-«(X)| 


a+p a 
= ——TVaR\-(a+p)(X) —- GTVaRi-a(X) (4.5) 
p B 

Let us introduce some additional notation: a = 1 — q and b = 1 — $. So it 
can be deduced that 1 — (a+ B) = a+ b- 1 and that q + p = 2 — (a + b). 
Therefore, last expression in (4.5) may be rewritten as 

2- (a+b) l-a 

jap eed (X) — Top aka (X) (4.6) 
Note now that a+ b— 1 < a because a, b € [0,1], and this implies that 
TVaRa(X) > TVaRg+p_1(X). Additionally, if 


v _ l-a Jü _2-(a+b) 
1 Tp i= — y (4-7) 
1 l-b 
hi = 1 2 = =f]; 
then wı + w2 p" +2-a-b] ip 


GLUEVAR AND OTHER NEW RISK MEASURES 59 


Recall that a usual way to write the value of a GlueVaR risk measure applied 
to X is as follows: 
GlueVaR3' (X) = @]TVaRg(X) + @2TVaRa (X) 
+ (1-0 — @2)VaRa(X) (4.8) 


where @] and @> depend on parameters @, p, hı and hz. 
Taking into account the previous expressions and notations, consider 


@=a+b-1 

p=a 

Po l-a 

O =w= Ee (4.9) 
Ted _2-(a+b) 

2=02= ——} 


As long as @] and &3 are related to hy and ho by the next expression (as it 
is shown in Section A.2 of the Appendix) 


Bat eee 


it is deduced from (4.9) and (4.10) that 


T- l-a 2-(a+b) l-a aU- 3) 
+= T= l-b |2-(a+b) 1-b ~ a 
~ l-a [2-(a+b)] l-b a 
ho = = = |: 

l-b l-b l-b 


So, putting (4.8), (4.9) and (4.11) altogether the following expression holds: 


]— 
GlueVaRo p1 0O = -7 TVaRa (X) 
2- (a+b) 
+ yup M (4.12) 


The right-hand side of expression (4.12) is exactly expression (4.6). This 
means that, for p > 0 (or, equivalently, for b < 1), 


RVaRa,g = GlueVaR”! 


a,a+b-1" 


For p = 0 (or b = 1), expression (4.12) is not well defined, but a+ b—1 = ain 
such a case and it can be checked that GlueVaR? = VaRa. To get the feel- 
ing that this is correct, let us plot the distortion function of GlueVaR”, pe 
in Figure 4.1. 


1 
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Figure 4.1 Distortion function of GlueVaR”” distortion risk measure. 


1 


a 
GlueVaR 7 a+p-1 


Looking at the plot of the distortion function in Figure 4.1, if b = 1 then 
2—(a+b) = 1- a and then the distortion function of VaR would appear. 
In addition, if a = 1 (which means that æ = 0) then the distortion function 
of TVaRz, is obtained. As it is one of the purposes of RVaRq,g risk measures, 
it is possible to continuously moving from VaR to TVaRj_,. 

Taking all the previous comments into account, the following equivalence 
holds: 


RVaRa,p = GlueVaR p1 (4.13) 


4.4 Example 


The example of risk quantification shown in Chapter 3 is followed to es- 
timate GlueVaR risk measures considering alternative distributions. Out- 
comes are shown in Table 4.2. The table is divided into four blocks, each 
block representing the corresponding risk figures for the cost of claims for 
property damage (X1), the cost of claims of bodily injuries (X2), the cost 
of claims of medical expenses (X3) and the aggregate cost of claims (X + 
Xə + X3). Risk measure values using the empirical distribution (first row) 
are compared with outcomes when Normal, Lognormal, Student t with 4 
df and Generalized Pareto distributions are fitted to data. In the last two 
rows of each block results are shown when risk measure values are approx- 
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imated by a Cornish-Fisher expansion. The sample mean (fi = Z), the sam- 
ple deviation (G* = £; (Z; — Z)7/(n— 1)) and the sample skewness (calcu- 
lated as ¥ = 67’ (X; (Zi— Z) /n)) are considered as estimators of p, 0 and y 
when Z is one of the four random variables X1, X2, X3, X1 + X2 + X3. Sam- 
ple statistics were computed using observations that fall below the 99.5% 
quantile in order to exclude the effect of extreme losses on estimates (first 
Comnish-Fisher approximation). That means, a subsample of the first 348 in- 
creasingly ordered elements of the random variable were used to estimate 
parameters. Therefore, the two highest values were considered as extreme 
losses and were not included. Outcome values of risk measures were com- 
pared with the risk measure approximations when all the observations are 
included on sample estimates (second Cornish-Fisher approximation). All 
the calculations were made in R and MS Excel. 


Table 4.2 Examples of risk measurement of costs of insurance claims using 
quantile-based risk measures 


Glue VaR 933%, 95% 

Model VaRg5%, TVaRg5y, TVaRgg5x, (44,2) (0,0) (4,3) 

Xı 
Empirical 2.5 12.5 40.8 18.6 94 49 
Normal 6.9 8.5 11.6 90 8l 7.2 
Lognormal 2.7 8.7 32.7 145 54 44 
Student t (4 d.f.) 8.8 12.8 24.6 15.4 115 98 
Pareto 2.5 5.5 18.1 8.7 4l 34 
Cornish-Fisher"® 8.5 27.8 128.4 549 16.6 15.1 
Cornish-Fisher?” 16.3 59.1 284.0 119.8 34.1 31.0 

X2 
Empirical 0.6 8.0 42.1 16.9 42 29 
Normal 6.0 7.4 10.2 79 7.1 63 
Lognormal 1.6 5.8 25.7 11.0 36 30 
Student t (4 d.f.) 7.6 11.2 21.7 13.5 10.1 85 
Pareto 1.6 3.5 11.9 57 26 2.2 
Cornish-Fisher®® 2.2 2.7 3.7 29 26 23 
Cornish-Fisher?” 6.1 7.5 10.3 8.0 72 64 

X3 
Empirical 1.1 1.3 1.8 l4 12 1l 


Continued on next page 
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Table 4.2: continued from previous page 


GlueVaR 935%, 95% 
Model VaRg5%, TVaRgsx% TVaRəg5% (44,2) (0,1) (4,3) 
Normal 0.9 1.0 1.3 1l 10 09 
Lognormal 0.8 1.4 3.2 1.8 1.2 1.0 
Student t (4 d.f.) 1.0 1.4 2.5 17 13 1l 
Pareto 0.9 1.4 3.0 18 12 10 
Cornish-Fisher®® 1.0 2.1 7.6 36 15 14 
Cornish-Fisher®”) 1.1 2.0 6.1 31 15 14 
Xı + X2+ X3 
Empirical 5.9 19.7 81.1 35.6 12.9 10.2 
Normal 12.6 15.4 21.1 16.4 14.8 13.2 
Lognormal 5.5 15.8 60.7 27.3 108 86 
Student t (4 d.f.) 16.0 23.2 444 278 208 17.7 
Pareto 5.0 11.0 36.0 17.3 81 6.7 
Cornish-Fisher*® 11.0 34.3 155.5 66.9 20.8 19.0 
Cornish-Fisher*” 34.6 134.0 659.7 276.1 75.6 68.9 


aa 7G 


f=0.5,6= 


2.3 and f = 6.4. Subsample without extreme losses. 


The two largest values of Xj are excluded. 


Of) t= 0.7,0= 
Ca ü =0.2,6 = 


3.8 and ¥ = 8.7. Full sample. 


1.2 and ¥ = 0. Subsample without extreme losses. 


The two largest values of Xz are excluded. 


Cb i= 0.5, = 
6a ñ =0.3, 6 = 


3.4 and f = 0. Full sample. 


0.3 and F = 2.6. Subsample without extreme losses. 


The two largest values of Xz are excluded. 


6b f= 0.3, = 
6a i= 1,0, 0= 


0.4 and f = 1.4. Full sample. 
3 and f = 5.9. Subsample without extreme losses. 


The two largest values of X1 + X2 + X3 are excluded. 


(4b) 5 


f=14,¢6= 


6.8 and f = 11.4. Full sample. 


Some comments related to outcome values for Cornish-Fisher approxima- 
tions of the quantile-based risk measures should be made. According to the 
results, one could think that this kind of risk measurement corresponds to 
a conservative attitude for the two types of approximations shown in Ta- 
ble 4.2. The exception would be risk values obtained when X3 is evaluated. 
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Relevant differences are observed depending on the approximation finally 
used on right skewed data. Outcome values related to the second Cornish- 
Fisher approximation (full sample) are drastically large when the data are 
severely right skewed distributed, as in the case of Xı and Xı + X2 + X3. 
These outcome values would be associated to a excessively conservative (un- 
realistic) attitude. If the first Cornish-Fisher approximation is considered, 
i.e. when sample statistics were estimated excluding extreme losses, a sig- 
nificant drop is observed although the outcome values for this approxima- 
tion are still larger than those values associated with the empirical or the 
parametric distributions for right-skewed random variables. Note that only 
the two largest losses are not included in the sample estimates involving the 
first approximation. When the data are slightly right skewed distributed, 
as in the case of X3, the two Cornish-Fisher approximations show a bet- 
ter performance. In other words, the Cornish-Fisher approximation should 
be used with certain caution when the data are severely right skewed dis- 
tributed. Probably higher order moments should be taken into account. 
An important issue that arises from these results is the model risk. Even 
when the same risk measure is used, huge differences are observed depend- 
ing on the hypothesis about the underlying distribution of the claim cost 
random variables. Let us assume that the regulator is focused on the VaRg5% 
for the aggregate cost X; + X2 + X3 as a measure of pure underwriting risk 
(without taking into account the premium paid by the policyholders). Ifit is 
supposed that the random variable is Pareto distributed, then the institution 
will need 5 thousands of euros for regulatory solvency purposes. The com- 
pany should set aside almost 3.2 times this economic amount whether the 
underlying distribution is Student t with 4 degrees of freedom. This topic 
is out of the scope of this chapter. The interested reader is addressed, for 
instance, to the study of Alexander and Sarabia [2012] which deals with VaR 
model risk or to the reference Aggarwal et al. [2016] suggested at the end of 
Chapter 1. 


4.5 Exercises 


; : 11/30,2/3 
1. Determine if the GlueVaR oq 5% 95% O 


able X with M" (u = 5,07 = 16) satisfies the expressions related to the 
properties of Translation invariance, Positive homogeneity and Strictness. 


f a Normal distributed random vari- 


2. Consider the Normal distributed random variable Xı with VW (u = 5, o? = 
16) and the Normal distributed random variable X> with WV (u = 4,07 = 
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20). Show that: 


© VaRq(Z) is equal to VaRg(X1) + VaRa (X2) for any a in the case that 
rxy = 1, where Z = X + X2 and rxy is the Pearson linear correlation 
coefficient. 


* When rxy < 1, then VaRq(Z) < VaRq(X1) + VaRa (X2). 


- Repeat the analysis for the GlueVaR g7] Aa risk measure. 


3. Check that if X is distributed as a Pareto (X ~ G@P(k,a), with k < 0), 
the expression for TVaRg (X) shown in Table 1.3 may be obtained either: 


1 1 
* From Definition 1.9 (i.e, TVaRg (X) = —| VaR, (X) dA), or 
-a Ja 


* From the expression of TVaRg (X) as a distortion risk measure shown 
in Section 3.2 (i.e. 


0 +00 
TVaRa = | [ya (Sx -1]dx+ | ya (Sx ae, 
—00 
where Ya is the distortion function displayed in Table 3.2). 
Hint: Note that, in this case, 


k Wk 
l = E VaR 00) = Sx(VaRa(X))=1- Q. 
o 


0,1 


4. Obtain the RVaR risk measure equivalent to the GlueVaRg5o 90%" 


5 Risk measure choice 


Tools designed to provide adequate risk measurements are needed by both 
decision making agents and regulatory agents, who require information about 
potential losses within a probabilistic framework. As such, the choice of a 
risk measure plays a central role in decision making in many areas includ- 
ing health, safety, environmental, adversarial and catastrophic risks [Cox Jr., 
L.A., 2013; MacKenzie, 2014]. Many different risk measures are available to 
practitioners, but the selection of the most suitable risk measure to be used 
in a given context is generally controversial. A key element in characteriz- 
ing a risk measure is the underlying risk attitude that is assumed when this 
measure is used for risk assessment. Therefore, in selecting the best mea- 
sure, the practitioner is concerned with how a particular measure matches 
up with the alternatives. However, this simple question only has a complex 
answer. 


Consider the Value at Risk and the Tail Value at Risk, probably the most com- 
mon risk measures used in assessing risk. Suppose @ is the confidence level, 
which reflects the degree of tolerance to undesirable events. The VaRg (X) 
is the a-quantile of loss X, while the TVaRg (X) averages quantiles ranging 
from the a-quantile to the maximum (the 100%-quantile) of X. Based on 
these definitions, it seems obvious that these two quantile-based risk mea- 
sures can be directly compared in terms of their respective conceptions of 
risk using their associated confidence levels. For instance, the VaR provides 
for a concept of risk associated with a barrier, beyond which the decision 
maker assumes that catastrophe lies [Alexander and Sarabia, 2012]. A VaR 
at a 95% confidence level presents a lower resistance to undesirable events 
than a VaR at a 99% level. This also holds for TVaRg (X). Comparisons of 
VaR and TVaR measures can likewise be readily undertaken when their re- 
spective confidence levels are fixed and equal. Given an -confidence level, 
the TVaRq (X) is always greater or equal than the VaR (X). However, a di- 
rect comparison cannot be made if the VaR and the TVaR risk measures 
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have different confidence levels. For example, imagine a decision maker 
wishes to compare the implicit risk attitude of the TVaRg5%(X) and the 
VaRgg9x%(X). In this instance, it is not immediately obvious which of these 
two risk measures offers the greatest risk tolerance. Furthermore, if the de- 
cision maker wants to know the risk attitude of a measure other than that of 
these two quantile-based risk measures, comparisons are even less intuitive. 
Here, we focus on the family of distortion risk measures, where the VaR and 
TVaR can be understood as two particular cases. This chapter seeks to con- 
tribute to the study of attitudes towards risk in the assessment of risk. The 
study analyses the risk perception that is implicit when an agent applies a 
particular distortion risk measure. A battery of instruments is developed 
to facilitate the comparison of the risk attitude of distortion risk measures 
from both global and local perspectives. The results afford new elements for 
determining the suitability of a particular distortion risk measure in com- 
parison with other available options. They also allow an agent to determine 
which risk measure provides the most risk tolerant behavior. 

An illustrative example of the risk attitude characterization implicit in a dis- 
tortion risk measure is included in this chapter. The European insurance 
regulatory framework serves as an excellent example of the choice of a com- 
pulsory risk measure, i.e. VaRg9.5%. However, insurers implement other 
choices in their internal tools. We show that, given a particular insurer’s 
dataset, distortion risk measures other than that of the Value at Risk can pro- 
vide the same risk estimates. However, if the insurer does choose a different 
risk measure, this chapter provides complementary tools for evaluating risk 
that can be used to understand its position in the European insurance or 
financial market, or even to benchmark it in relation to the mandatory risk 
assessment standard. 


51 Aggregate attitude towards risk 


The characterization of the implicit attitude towards risk in a given distor- 
tion risk measure is carried out by means of the analysis of the distortion 
function, which provides a precise portrait of the underlying risk position 
of a decision maker when selecting a particular risk measure for risk assess- 
ment. 

Let us consider the mathematical expectation. As indicated previously in 
Section 3.2 of Chapter 3, the mathematical expectation can be understood 
as a distortion risk measure involving the identity function id as the associ- 
ated distortion function. Figure 5.1 illustrates the distortion function associ- 
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ated with the mathematical expectation. In other words, the mathematical 
expectation can be understood as the distortion risk measure used by agents 
when survival probabilities are not distorted (i.e. they are distorted by the 
identity function). So, the mathematical expectation can be associated with 
a risk neutral attitude of the agent. 


Figure 5.1 Distortion function of the mathematical expectation 


The area under the distortion function can be understood as an indicator of 
the aggregate risk attitude of an agent, with decision makers being classified 
as risk tolerant, risk neutral or risk intolerant. Note that we assume that a 
risk neutral agent would not distort the survival distribution function, so the 
associated distortion function linked to aggregated risk neutrality would be 
the id function. The area under the id function is one half and this value 
could be used as a benchmark of global risk attitude. A globally risk intol- 
erant agent would make an upper distortion of the survival distribution in 
accumulated terms; thus, the area under g for this agent would be larger 
than one half. Similarly, an agent would be globally risk tolerant if the area 
under g was lower than one half. 

The distortion functions of the Value at Risk and Tail Value at Risk with alpha 
confidence level are shown in Figure 5.2 and Figure 5.3, respectively. From 
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Figure 5.2 it is straightforward to check that the area under the distortion 
function of the VaRgq is a. Similarly, from Figure 5.3 it is easy to observe that 
the area under the distortion function of the TVaRg is a + (1 — @)/2. 


Figure 5.2 Distortion function of the VaRq risk measure 


eee a 


4 è 


l-a 1 


An agent would be globally risk neutral using either the VaRg measure with 
a = 0.5 or the TVaR« with æ equal to zero. In terms of aggregate risk neu- 
trality it would be equivalent to use either the mathematical expectation, 
or VaRso% or TVaRo%. Note that VaRsoy is the median and TVaRox is the 
mathematical expectation. Similarly, an agent would be globally risk intol- 
erant using either the VaRg with æ > 0.5 or the TVaRg with a positive a. 
On the contrary, an agent would be globally risk tolerant using the VaRq 
with æ < 0.5. It is worthy to emphasize that under the definition of aggre- 
gate risk attitude followed in this chapter an aggregate risk tolerant agent 
would never use the TVaR risk measure, since the area under the distortion 
function is bounded in the interval [0.5, 1]. 

GlueVaR risk measures were introduced in Chapter 3 as a class of distortion 
risk measures. Recall that these measures are defined by means of four pa- 
b- 


rameters a, p, wı and w2,suchthat0 <a < <1, 


<wı <landwı+ 
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Figure 5.3 Distortion function of the TVaR g risk measure 


w2 < 1. This parametric flexibility of GlueVaR risk measures makes them 
useful in a variety of contexts, because each particular risk context could re- 
quire a different set of parameter values. In the case of the GlueVaRg y, 
the implicit global risk attitude would depend on the values of the four } pa- 
rameters that define the risk measure. 

To conclude, in Belles-Sampera et al. [2013b] we introduced some relation- 
ships between distortion risk measures and aggregation operators. Aggre- 
gation functions (or operators) are mathematical functions used for com- 
bining information in many fields of human knowledge, as artificial intelli- 
gence, biology or economics [Torra and Narukawa, 2007]. Aggregation in- 
dicators are used to characterize the aggregation function [Belles-Sampera 
et al., 2014c, 2013c]. One of the most frequently used indicators is the de- 
gree of orness, which seeks to summarize the importance of each ith-order 
statistic, i = 1,...,, inthe aggregation process associated with the Choquet 
integral with respect to a set function. This indicator provides some kind of 
level of preference inherent to such an aggregation function on a [0, 1] scale, 
where 0 represents the minimum and 1 the maximum order statistic (Duj- 
movic [2006]; Fernandez Salido and Murakami [2003]). In Belles-Sampera 


70 RISK QUANTIFICATION AND ALLOCATION METHODS FOR PRACTITIONERS 


et al. [2016c] we showed that the degree of orness can be interpreted as an 
approximation of the area under the distortion function. 


5.1.1 Local risk attitude 


It is reasonable to suppose that decision makers do not worry about all ran- 
dom event losses in the same way. Decision makers frequently treat differ- 
ent random events distinctly (note that some of these events can represent 
benefits or affordable losses). While the area under the distortion function 
evaluates the accumulated distortion performed over the survival distribu- 
tion function, it does not take into account which part of the survival distri- 
bution function was distorted. Clearly, from the perspective of a manager, 
distorting the survival probability in the right tail of the random variable 
linked to losses is not the same as distorting the probability in the left tail. 
Additionally, all distortion functions with an area equal to one half would be 
associated with global risk neutrality, where the id function is only a par- 
ticular case. 

In Figure 5.4 it is shown an example in which the size of the area under 
several distortion functions is the same. Obviously, these distortion func- 
tions have not associated the same risk attitude. In the case of the distor- 
tion function represented by a dotted line, survival probability values in the 
interval [0, 0.5] are overweighted and survival probability values in the inter- 
val [0.5, 1] are underweighted. So, relatively high losses are overrepresented 
(right tail) and relatively low losses are underrepresented. On the contrary, 
the distortion function represented by the solid line overweights relatively 
low losses and underweights high losses. Note that also the area under the 
diagonal, which is in fact the distortion function of the mathematical expec- 
tation, is the same. 

Therefore, the global vision of risk embedded in a risk measure has to be 
completed with local information. One option open to us is to define the risk 
attitude in absolute terms. An absolute risk neutral agent is a decision maker 
that does not distort the survival probability and who, therefore, uses the id 
function as the associated distortion function, i.e. g(u) = id(u) = u for all 
0< u< l1. An absolute risk intolerant agent is associated with a distortion 
function g such that g(u) > u, for all 0 < u < 1. And, similarly, an absolute 
risk tolerant agent has a distortion function g such that g(u) < u, for all 
0< u< l. This definition of risk attitude is in absolute terms in the sense 
that the relationship of ordering between g(u) and u must be fulfilled in the 
whole range [0, 1]. Note that these considerations lead to a more restrictive 
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Figure 5.4 Example of distortion functions with the same area 


definition of risk attitude than that provided by the aggregate risk attitude. 

The definition of the absolute risk attitude implies that the implicit attitude 
of an agent is invariant over the range of values. Yet, there are no reasons 
as to why the agent should have a unique risk attitude across the whole do- 
main. An agent’s attitude to risk is likely to differ in accordance with the 
interval of loss under consideration. The risk attitude implicit in frequently 
used risk measures is not invariant; this is the case, for instance, of the VaRq. 
When using the VaRg, a risk intolerant attitude is associated with the in- 
terval [1 — a, 1), but a risk tolerant attitude is associated with the interval 
(0, 1— a). Thus, an homogeneous risk attitude cannot be linked to the VaRq 
risk measure throughout the domain. 

Let us define a quotient function Qg from (0, 1] to R, based on the distor- 
tion function g associated with the risk measure, in order to characterize 
the local vision of risk. Let the function Qg be defined as the quotient be- 
tween the distortion function g and the identity function, Qg(u) = gu for 
all 0 < u < 1. The Qg allows the analysis of the agent’s perception of risk 
at any point in the survival probability distribution. This quotient function 
provides a function of survival probabilities, u, which describes the distor- 
tion factor applied by g at each u level. The quotient Qg is a quantifier of 
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the local risk tolerance of the agent at any point. The quotient value repre- 
sents the relative risk attitude of the decision maker compared to that of an 
agent with a risk neutral attitude who is confident of the survival probability. 
An agent is risk neutral, risk tolerant or risk intolerant at point u if Qg (u) is 
equal to, lower or higher than one, respectively. 

A graphical analysis of the quotient function is proposed to investigate the 
risk attitude of the agent at any point in the survival distribution function 
when using a certain risk measure. An interesting characteristic is that the 
quotient function is bounded. Since the quotient function computes the 
ratio between the distorted survival probability and the survival probability, 
so 4 is the maximum value attainable by this quotient function. In fact, 
the maximum risk intolerance frontier at the survival value equal to u is 
achieved when Qg (u) = 7 (upper bound). Note that Qg takes non-negative 
values. The maximum local risk tolerance frontier is achieved when Q¢(u) 
is equal to zero (lower bound). In addition, when the agent does not distort 
the survival probabilities, Qg (u) takes value equal to 1 (local risk neutrality 
line). 

In Figure 5.5 bounds of the Qg are plotted. Upper and lower bounds are 
represented in the Figure 5.5 by a solid line. The local risk neutrality line 
is plotted by a dotted line. An agent’s risk intolerance (tolerance) attitude 
emerges at point u when the quotient function is bigger (smaller) than one. 
As the quotient function is bounded, we can deduce at any distorted survival 
value how far the value is from the maximum risk intolerance/tolerance. 
The evaluation of the local risk appetite pattern ofa manager using the VaRq 
and TVaRg is investigated. In Figure 5.6 the quotient functions associated 
with the VaRq and TVaRg are displayed, Qy, and Qy, respectively. 

If we focus our attention on the quotient function associated with the VaR, 
Qy,» it can be seen that a radical risk attitude is implicit in the interval 
[1—a, 1), shifting to the opposite extreme in the interval (0, 1— œ). Indeed, a 
maximum risk intolerance is involved in [1—a, 1) and a maximum risk toler- 
ance attitude is involved in (0, 1 — æ). Some similarities are found when the 
quotient function associated with the TVaR is examined, Qy,. Two ranges 
involving a different risk attitude are also distinguished. Maximum risk in- 
tolerance is involved in the interval [1—a@, 1) anda constant (non-boundary) 
risk intolerance attitude is involved in (0,1 — æ). In that interval, the quo- 
tient function value is farther to the maximum as closer is to zero the sur- 
vival probability. Unlike the VaRq, an absolute risk intolerance attitude is 
associated with the TVaRq because the quotient function is larger than one 
throughout the range (0, 1). 
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Figure 5.5 Bounds of the quotient function 


Figure 5.6 The quotient function of VaRg (left) and the quotient function of 
TVaRg (right). 


0 0 


l-a 1 l-a 1 


Note: Distortion functions only differ in terms of the interval [0,1 — œ). The quo- 


tient function of the mathematical expectation, Qia, is represented by a horizontal 
dotted line. 


naa 


The quotient function of the GlueVaR 
ticular shape for the Q, wie depends on "the values of the four parameters 


is not plotted because the par- 


that define the risk sigs ute However, as in the former two risk measures, 
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a maximum risk intolerance is involved in the interval [1 — a, 1) when the 
quotient function of the GlueVaR y” 
just one attitude can be involved in the range (0, 1 — æ). The high flexibility 
of GlueVaRg y allows multiple attitudes towards risk to be implicit in the 
range (0, 1— a), depending on the values of the remaining three parameters, 


p, wı and w2. 


is analyzed. However, more than 


In short, the quotient function Qg can be used to characterize the relative 
risk behavior of an agent at any point. The value of a quotient function at a 
particular point depends on the distortion function as well as on the origi- 
nal survival function. In other words, risk attitude in the quotient function 
is contemplated in the size of the distortion performed (the numerator) but 
also in the position in which this distortion is performed (the denominator). 
Note that the area under the quotient function provides similar information 
to that of the area under g, but expressed in terms of risk neutrality. Indeed, 
the area under the quotient function can be interpreted as the area under a 
weighted distortion function, where weights (1/ u) are given to distorted val- 
ues, i.e. g(u)- L, Following this interpretation, greater weights are assigned 
to distortion function values associated with lower survival values. The areas 
under the quotient function of VaRg and TVaRg are A (Qu) =-—In(1-a) 
and A (Qy,) = 1—In(1- a), respectively. Similarly, the area under the quo- 


W),W2 ; 


tient function of the GlueVaR Ba is equal to 


1+m(7—4] 


Evaluating the area under the quotient function may be useful when analyz- 


+W2—-In(1-a). 


A (Quer = W] 


ing the aggregate risk behavior in situations in which values of the distortion 
function are weighted, indicating that risk intolerance is negatively associ- 
ated with the size of the survival values. Thus, the area under the quotient 
function can be interpreted as a weighted quantifier of the aggregate risk 
attitude, where an area equal to one indicates aggregate risk neutrality, an 
area larger than one indicates aggregate risk intolerance and an area lower 
than one indicates aggregate risk tolerance. 


Example 5.1 (Obtaining risk attitudes of a RVaR risk measure). Taking into 
account the arguments presented in this chapter, the following corollaries 
of equivalence (4.13) proven in Section 4.3 of Chapter 4 may be stated: 
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* The aggregate risk attitude of RVaRq,g can be obtained as the area under 
the distortion function plotted in Figure 4.1: 


2-(a+b)—-(1-a) 230 E T 1—b+2(a+b)-2 
2 2 
_ 2a+b-1 
7 2 
= (5a) 


* The local risk attitude of RVaRg, g is given by the following quotient func- 


tion: 
0 if we [0,a] 


— if ue (a,a+ p] 


Qrvar (u) = (5.2) 


1 
— if ue (æ+ f,1 
F if we (a+ B, 1) 


A numerical illustration of the results listed before is provided. Let us eval- 
uate a RVaRo,.1%,5% risk measure. It combines VaR at the 99.9% and TVaR 
at the 95%. Parameters are a = 0.1% and f = 5%. From expression (5.1) 
the aggregate risk behavior of RVaRo.1%,5% is a = 1948% = 97.4%. 


From expression (5.2), the specific risk attitude of RVaRo.1%,5% is described 
by the following quotient function: 


0 if ue [0,0.1%] 
1000u-1 . 

Qavar(W) =4 zou if u € (0.1%, 5.1%] (5.3) 
1 


— if ue (5.1%, 1] 
u 


The equivalent GlueVaR risk measure to RVaRo.1%,5% is the one with pa- 
rameters a = 99.9%, a + b — 1 = 99.9% + 95% — 1 = 94.9%, hı = 0 and 
hz = 1. In other words, 


2 0,1 
RVaRo.1%,5% = GlueVaRg9 9% 94.9%" 


Given that a general expression for the overall or aggregate risk attitude be- 


hind a GlueVaR5',” risk measure is the following 


_ (14+B-2@) ` (1-@) _ 
Oi — 5 ee 5 +@ 


76 RISK QUANTIFICATION AND ALLOCATION METHODS FOR PRACTITIONERS 


oe ; 0,1 ; 
let us check if, using this formula for GlueVaRgg 9%,94.9% the 97.4% is re- 


covered. From expressions (4.9): 


@ = a+ b- 1 = 94.9% 
B= a = 99.9% 
l-a 0.1% 2 
© =W,= = 2 — (5.4) 
l-b 5% 100 
p 2-(a+b) 194.9% 194.9 
@o = Wo = ———— = = ———. 
E SE 1-b 5% 5 
Then 
2 (14+99.9% -2 -94.9% 5.1% / 1- 94.9% 
— — | —— j —— | +94.9% 
100 2 5% 2 
2 (10.1% 5.1 (5.1% 
+ + 94.9% 
100 2 5 2 


—0.101% + 2.601% + 94.9% = 2.5% + 94.9% = 97.4%. 


5.2 Application of risk assessment in a scenario 
involving catastrophic losses 


This section illustrates how the above findings can be applied in character- 
izing underlying risk attitudes. It is devised to highlight situations in which 
the implicit risk attitude linked to the VaR is unable to detect changes in 
potential catastrophic losses. We argue that the use of equivalent GlueVaR 
risk measures can be helpful in overcoming this drawback. 

Suppose that the VaR with a confidence level a = 99.5% is required to as- 
sess the regulatory capital under some regulatory framework. Note that the 
selection of the confidence level involves a trade-off between protection and 
competitiveness. The level of the protection could be reduced with low con- 
fidence levels. An increase in the confidence levels could involve higher eco- 
nomic reserves and, therefore, the protection would rise; however, this could 
also affect the competitiveness. 

Risk managers may dislike using the VaR as a risk measure, because of the 
lack of risk-based information it provides on catastrophic losses. Indeed, 
two firms with marked differences in the sum of their potential losses in ad- 
verse scenarios may report the same risk value, even though they are not 
exposed to the same level of risk. As such, their disparities would go un- 
observed by decision makers. Moreover, the lack of subadditivity may well 
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constitute another drawback. Alternatives to VaRgg.5% that take into ac- 
count catastrophic losses can be considered by risk managers. Traditional 
approaches frequently lead to severely higher economic reserves. Managers 
need to find a risk measure that generates similar economic reserves than 
VaRgg_.5% for the overall risk faced by the insurance company and, addition- 
ally, they would like that the alternative risk measure provides risk-based 
information on catastrophic losses and that, hopefully, it satisfies appealing 
subadditivity properties. 


5.2.1 Calibration of GlueVaR parameters 


Our goal is to find the set of GlueVaR risk measures that return the same 
risk value that the VaRgg.5y in a particular context. So, we need to find the 
parameter values that define the GlueVaRg 
required in calibrating GlueVaR risk measures are described here. The crite- 


risk measures. All the steps 


rion followed in the calibration procedure is the need to obtain the same risk 
measure value with the GlueVaR risk measures as the one obtained with 
the VaRg9.5%. Moreover, the selection of the risk measure is restricted to 
the subfamily of GlueVaR candidates that may satisfy that their distortion 
function is concave in [0, 1 — æ). The strategy for calibrating the parameters 
is as follows: 


- Minimum and maximum admissible values of the a and p confidence 
levels have to be determined, @min and Bmax. 


- Let us assume that Z random variable represents the overall risk. A set of 
d x d constrained optimization problems is defined at this step: 


Pij: min |GlueVaR,""7"(Z) — VaRo9.5%(Z)I, 


@1,W2 
0<a) <1, 
subjectto 4 0< w2, 
wı +w <1 


Sy i-l 
where l, J = 1,...,d, Qi = Amin + Faq (Pmax Amin) and Bj =ajt 


j—1 
— (Bmax — @;). Flexibility rises with the number of partitions d, as do 
computational costs. Constraints are fixed to guarantee that the distor- 


tion function of the GlueVaR is concave in [0, 1 — æi). 


- An optimization algorithm should be used to solve this set of problems. 
If Pj, j+ represents the problem for which the minimum value of the ob- 
jective function is reached and (w{,«5) is the associated solution, then 
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is found with its distortion function concave in [0, 1 — 


pria 
w1 ,W2 
jeje 


a GlueVaR B 
aj) and gives similar risk values to those obtained with VaRg9.5% when 
applied to the overall risk of the company. P; į problems may not have 
solutions. Were this to be the case, then the optimization criteria would 


have to be revised, including a lower @ min, ahigher Bmax and/or a larger d. 


- More than one GlueVaR solution is frequently found. Alternative com- 
binations of parameter values return the same objective function value, 
or a value that differs insignificantly. In this situation, solutions could be 
ranked in accordance with the underlying risk attitude involved. Here, 
we propose ranking the solutions based on the value of the area under 
the distortion function associated with each optimal risk measure. With 
this goal in mind, degrees of orness are computed for (multiple) optimal 


WW: z A : 
GlueVaR Bis a Solutions. Two particular GlueVaR risk measures among 
E ie dasa 5 


the set of solutions are of special interest: 


Lower-limit solution. Selection of the GlueVaR risk measure with the as- 
sociated minimum area under the distortion function; 


Upper-limit solution. Selection of the GlueVaR risk measure with the as- 
sociated maximum area under the distortion function. 


In other words, boundaries of the area size under distortion functions are 
detected. Optimal GlueVaR risk measures linked to boundaries reflect the 
extreme risk attitudes of agents when the random variable Z is analyzed. 


5.2.2 Data and Results 


We are going to use the dataset used in previous chapters. It contains Xj, 
Xə and X. Total claim costs are the sum of the three random variables, 
Z = Xı + Xə + X3. So the aggregate risk faced by the insurer is the sum of 
the three random variables X;, i = 1,2,3. We assume that the insurer uses 
the VaRgg.5% as its risk measure. 

Before dealing with the calibration of the GlueVaR risk measures, we first 
compute the VaRg9.5%(Z) and its associated area under its distortion func- 
tion. The risk measure value is equal to VaRgg,5%(Z) = 51.05 and the area 
under its distortion function is equal to 0.995. Let us now focus on the strat- 
egy used to calibrate the GlueVaR parameters. The following steps are per- 
formed to obtain GlueVaR risk measures that are comparable to the VaRgg.5%(Z): 


a) the minimum and maximum values of confidence levels are fixed at 90% 
and 99.9%, i.e. @min = 90% and Bmax = 99.9%; 
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b) the number of partitions is stipulated in d = 25, so we deal with 625 
optimization problems; 


c) the empirical distribution function of total claim costs is used for the risk 
quantification, and, finally; 


d) the outcome choice of the GlueVaR solutions are obtained using con- 
strOptim function from rootSolve library in R. 


A more complex calibration problem involving a modified random variable 
including catastrophic losses can be found in Belles-Sampera et al. [2016c]. 
We obtained a set of optimal GlueVaR risk measures that give the same risk 
value as the VaRgg.5% in this specific context. Thus, 192 optimal solutions 
were found. Once a set of GlueVaR risk measures has been obtained as feasi- 
ble solutions, the areas under the distortion functions linked to each Glue- 
VaR were computed to characterize the respective underlying aggregated 
risk attitude. The boundary values and the associated GlueVaR risk mea- 
sures were identified. We should emphasize that the maximum area was 
equal to the area of the VaRgg.5%. In fact, the optimal GlueVaRg y” solution 
with the highest area size was the GlueVaR with parameters a = 99.5%, p = 
99.9% and w1 = w2 = 0, and it holds that GlueVaR§) go 99 5% = VaRoo.5% 
(see expression (3.4)). In other words, given a certain risk value, the VaRg 
is the GlueVaR risk measure that presents the highest area under the asso- 
ciated distortion function of all the GlueVaR risk measures that return this 
value. Recall that the distortion function associated with the VaRg assigns 
one to survival values higher than (1 — œ) and zero to the rest. 

The minimum area under the distortion function and the associated Glue- 
VaR risk measure for the original dataset are reported in Table 5.1. Informa- 
tion about the underlying aggregate risk attitude of the agent can be inferred 
from the minimum area. Table 5.1 shows that, for this dataset, there exists 
an optimal GlueVaR risk measure for which the area of the associated distor- 
tion function is approximately 0.949. Thus, this GlueVaR risk measure gives 
the same value as that given by VaRgg.5y% when applied to Z, but, in aggre- 
gate terms, it involves a more moderate distortion of the original survival 
distribution function and, consequently, less aggregate risk intolerance. 
The area under the distortion function should be complemented with the 
examination of the quotient function which allows the relative risk attitude 
at any point of the survival distribution to be analyzed. The quotient func- 
tions of risk measures associated with boundary areas in both scenarios are 
examined. All the quotient functions analyzed are located in the upper risk- 
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Table 5.1 Optimal GlueVaR risk measure 


Original dataset 
0.900 
0.999 
w 0.483 
w2 1.035- 107° 
w3 0.516 
Area under the 
distortion function 0.949 


Note: Parameter values of the associated GlueVaR risk measure equivalent to 
VaRgg9.5% and minimum area under the distortion function. 


tolerance frontier in the range [0.10,1). For ease of comparison, the quo- 
tient functions are rescaled and their left-tails are shown only in the range 
[0, 0.10] in Figure 5.7. 


Notable differences can be observed in the relative risk attitudes locally im- 
plicit in the left-tail of the quotient functions. Let us first examine the quo- 
tient function of the GlueVaR risk measure that presents the maximum de- 
gree of orness (left), which is the same quotient function associated with 
the VaRgg,5%. The agent is most risk intolerant at any point of the interval 
[0.5%, 1) and maximum risk tolerant at (0,0.5%). This means, the quotient 
function is located in the upper frontier at [0.5%, 1) and in the lower frontier 
at (0,0.5%). When the GlueVaR risk measure that presents the minimum 
area is analyzed (right), the patterns of the left-tails of the quotient func- 
tions are undoubtedly different. An interesting finding is that the Qg (u) is 
not located within the boundaries at any point of the interval (0,0.10). This 
means, the risk intolerant attitude is not maximized in the range [0.5%, 0.10) 
but, on the contrary, the agent is more risk intolerant to catastrophic losses 
at (0,0.5%) than when using VaRgg.5%. 
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Figure 5.7 Quotient functions of optimal solutions 
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Note: Maximum (left) and minimum (right) areas under distortion functions of op- 
timal GlueVaR risk measures. Dashed curve indicates the upper-bound quotient 
function curve, i.e. 4 forallO<us<l. 


5-3 GlueVaR to reflect risk attitudes 


Distortion risk measures are widely used for assessing risk in a range of dif- 
ferent contexts. As we have seen, the risk value obtained from such risk 
measures depends basically on the characteristics of the random variable, 
which are captured by the survival distribution function, and the associated 
distortion function. These two elements can be disaggregated and, here, our 
attention has focused on an analysis of the distortion function, which can 
be understood as a weighting function of survival probabilities. Thus, any 
risk attitude implicit in a distortion risk measure is to some extent attached 
to its distortion function. 


The analysis of the risk attitude implicit in the Value at Risk illustrates that it 
is not sensitive to changes involving riskier scenarios. The reason for this is 
that the risk measure presents two extreme risk attitudes, i.e. maximum risk 
resistance in [1— a, 1) and maximum risk intolerance in (0, 1 — æ). Here, we 
have shown that additional risk information may be obtained using com- 
parable GlueVaR risk measures that are calibrated to report the same risk 
value as that of the VaRg. The calibration procedure of GlueVaR parame- 
ters was not the primary focus of this chapter and it was not our intention 
to cover all the possible calibration criteria. However, two final remarks are 
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worth recording. First, the procedure proposed depends necessarily on the 
risk measure of reference and also on the random variable. As such, the 
set of comparable GlueVaR risk measures differs when the random variable 
changes. This is not an unbridgeable drawback when requesting to the su- 
pervisory authorities authorization for changing the risk measure to a Glue- 
VaR one, if decision makers are able to justify the GlueVaR selection process. 
For example, these measures may be used to analyze variations from one 
year to next in the implicit risk attitude of boundary cases when the VaRg is 
applied to assess the annual risk. Second, the VaRg was chosen as the risk 
measure of reference because of its application in practice, but the calibra- 
tion strategy of the GlueVaR risk measures could easily be adapted to other 
risk measurement problems. 


5-4 Exercises 


11/30,2/3 
85%,50% ” 


What do you can say in terms 


1. Compare the area under the distortion function of the GlueVaR, 


GlueVaR? d the Glue VaR pose so% . 


85%,50% 0 
of aggregate risk attitude associated to these risk measures? 

Hint: The heights (hı, h2) equal to (11/30, 2/3), (0,1), and (1/20, 1/8) 
correspond to (w1 = 1/3, w2 = 1/3), (wı = —1/9, w2 = 10/9) and (w; = 
1/24, w2 = 1/12), respectively. 


2. Plot the quotient functions of previous GlueVaR risk measures. Discuss 
what additional information (if there are) in terms of local risk attitude is 
provided when the quotient function of these risk measures is analyzed. 


3. Compute the area under the distortion functions of the RVaRı%,5% and 
RVaR5%,1%- 


- What do you can say in terms of aggregate risk attitude associated to 
these risk measures? 

- Discuss their quotient functions in terms of the implicit local risk atti- 
tude. 


4. Analyze the area under the distortion function and the quotient function 
of the Wang-Transform risk measure [Wang, 2002]. Discuss the relation- 
ship between @ and the underlying risk attitude. 


Hint: The distortion function of the Wang-Transform risk measure is de- 
fined as g(u) = ®[®!(u) — A], where u = S(x) and A = $~} (a) with 
the security level a. 


PART Il 


CAPITAL ALLOCATION PROBLEMS 


6 An overview on capital 
allocation problems 


Capital allocation problems of insurance and financial institutions arise when 
a management unit must distribute an amount of resources among different 
business units. These resources may be the aggregate cost faced by the com- 
pany, its solvency capital requirement or the total variable economic com- 
pensation to be shared across business units, among other examples. This 
kind of problems are frequent and relevant from an Enterprise Risk Manage- 
ment (ERM) perspective, mainly if the risk that each business unit faces is, 
somehow, taken into account for the final allocation. Sometimes the capital 
allocation is merely notional, as pointed out in Dhaene et al. [2012b]. This 
does not diminish the importance of studying these problems at all. The al- 
location information may be useful to conduct different business analyses 
in order to improve the risk management of the company. 

Main concepts and notations regarding capital allocation problems are in- 
troduced in the next section. These are the building blocks on which the 
rest of the discussion is based and represent a necessary starting point to 
go further in next chapters. The overview on capital allocation problems 
is completed with a description of some particularly interesting solutions 
and with a list of properties that particular solutions to these problems may 
satisfy. 


6.1 Main concepts and notation 


In general terms, a capital allocation problem may be understood in the fol- 
lowing way: 


“An amount K > 0 of monetary units has to be distributed across n € N 
agents, and the allocation must be a full allocation.” 
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Described in such a way, capital allocation problems can be understood as 
disaggregation problems. Several comments must be made in relation to 
capital allocation problems. First of all, on the risk management framework 
in which this kind of problems arises. These problems are strongly related to 
the fact that risk managers from the insurance and banking industries must 
determine, at different levels of granularity, the contributions of agents to 
the risk-based regulatory capital required to companies. In that sense, the 
concept of agent must be understood in a broad way: it may be a commer- 
cial agent, a business unit, a branch of the overall business or even a par- 
ticular guarantee included in a set of contracts. Nonetheless, it has to be 
noted that similar risk management problems are faced by asset manage- 
ment firms when planning investment strategies or when assessing perfor- 
mance of their investment portfolios. In such contexts it is more usual to re- 
fer to these problems as portfolio risk attribution or risk budgeting problems 
[see, for instance, Grégoire, 2007; Rahl, 2012]. 

Secondly, it is important to list the main elements that play a role in a cap- 
ital allocation problem. Capital allocation problems may be described by 
means of the following elements: 


: The capital K > 0 to be distributed; 
: The agents, indexed by i = 1,..., n; 
- Random variables linked to each agent, {X;}i=1,....n3 


- Functions fj, i = 1,..., n, used to simplify the information provided by 
each X;; 


- A distribution criterion; 


: Capitals K;, i= 1,..., n, assigned to each agent as a solution to the prob- 
lem; 


: The goal that is pursued with the allocation. Some examples are cost of 
risk allocation, reward to riskless allocation or reward on risk and return 
(risk&return) allocation. 


In particular, a solution ofa capital allocation problem is the set of n capitals 
{K;}i=1,...,n allocated to each agent, where capital K; is the amount of capital 
assigned to the ith agent. It is usually required that the solution satisfies 
the full allocation property. It happens when the set of capitals adds up to 
K, that is, 2", Ki = K. The capital amount assigned to the ith agent K; is 
related to the risk X; faced by that agent. Random variables Xj, i = 1,..., n, 
are frequently representing ith agent’s losses. 
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A solution to a capital allocation problem is called a capital allocation prin- 
ciple. One of the fundamental elements characterizing a capital allocation 
principle is the distribution criterion that drives the allocation. Proportional 
allocation criteria are such that each capital Kj, i = 1,...,n, may be ex- 
pressed as the product of capital K times a proportion of the form 


fi(Xi) 
2 fiX 
j=1 


where f; functions simplify all the information provided by X;. Therefore, 
the general expression for a proportional allocation principle is 
qe i=1,...,7. (6.1) 
2 fiX) 
j=1 


Frequently, functions f; are risk measures or partial contributions to the 
value that a risk measure assigns to the whole random loss understood as 
SSA oo Xj. If fi is a risk measure p, the proportional allocation principle 
is classified as a stand-alone proportional allocation principle. On the other 
hand, when dealing with {fj} ;=1,..... which represent partial contributions 
to p(S) for a given risk measure p, the proportional allocation principle is 
based on marginal or partial contributions. The name is inherited by the fact 
that expression p(S) = Li fj (Xj) holds. In such those cases, notation 
fi (Xi) = p(X; | S) is going to be used and, therefore, the general expression 
for proportional allocation principles based on partial contributions is 


=K: p(X; lS) 
p(S) 


Among proportional capital allocation principles, the main difference be- 


n t=l1,...,n. (6.2) 


tween stand-alone principles and the ones based on partial contributions 
is related to diversification effects. Stand-alone principles do not take into 
account neither benefits nor penalizations on risk of each ith agent due to 
the fact that the agent belongs to a set of agents, while principles based on 
partial contributions do. That is, stand-alone principles do not take into 
account dependencies between risks and partial contributions based prin- 
ciples take into account that risks are interconnected. 

Non-proportional allocation principles are such that an expression like (6.1) 
for each of the assigned capitals K;, i = 1,..., n, cannot be achieved. An ex- 
ample of non-proportional principle is the excess based allocation principle 
shown in Section 6.3.3. 
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A significant number of principles can be included in the framework pro- 
vided by Dhaene etal. [2012b]. For instance, when using the so-called quad- 
ratic optimization criterion, principles like 


n 
Ki = pi(Xi) + vi: |K- } p;p (6.3) 
j=l 
where p;, i = 1,..., n, are risk measures and v; are weights such that 27 Vj 


= 1. Ifall v; are equal to p;(X;i)/ (Dye 1 P(X) ), then it is a proportional 
allocation principle. To obtain non- proportional allocation principles, at 
least one of the v;, say Vip, must be not equal to p;(X;)/ be 1 Pj (Xp) ). In 
other words, there must exist an ig € {1,..., n} such that 


Last but not least, a major issue is the goal pursued with the allocation. We 
here propose three different goals: cost of risk, reward to riskless and reward 
on risk and return. Other alternative goals may be considered and they will 
depend on the opinion of decision makers (risk managers, regulators, etc.). 
The aim ofa capital allocation problem with a cost of risk goal is to distribute 
the cost among the agents by taking into account some measure of the risk 
faced by each one of them. An example of such a cost of risk allocation is 
the disaggregation of the Solvency Capital Requirement (SCR) of the whole 
business of an European insurance company under the Solvency II regime 
among its lines of business. 

Assume that a management team wants to stimulate a risk averse attitude 
among the business units it has in charge, this management team may adopt 
a compensation scheme based on the following idea: the riskier the business 
unit is, the lesser the reward it receives. In such a situation, a capital allo- 
cation problem with a reward to riskless objective is conducted. This kind 
of problems rarely appears in practice. But, if we try to think in an example 
of such a capital allocation problem, it could appear when there is the re- 
quest of notionally distributing the contribution of each agent to the overall 
diversification benefit, where only there is information about a final cost of 
risk allocation and the overall diversification benefit. That is, where there is 
not information about each individual diversification benefit. 

A much more frequent problem faced by managers in practice is to allo- 
cate capital under a reward on risk and return criterion, in order to better re- 
ward those agents whose trade-off between return obtained and risk faced 
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is higher. In this context, a return-on-risk measure seems to be the natural 
choice of functions {f;};=1,...,n in order to assign rewards under the reward 
on risk and return perspective. In next sections some examples related to 
this type of allocation are going to be provided. 

A principle is denoted in this book by K= (Kı, K2,..., Kn). An abuse of 
notation is made because K is used both to denote the vector K whose 
components are K; and the capital to be distributed among agents. Given 
an (absolute) capital allocation principle K with K = > rat Kj, its relative 
counterpart is defined as the n dimensional vector X, whose components 
are x; = K;/K and satisfy that A x; = 1. If there is no room for con- 
fusion, upper-case letters mean absolute principles while lower-case letters 
mean relative ones. This notation is used in next chapters. 


6.2 Properties of capital allocation principles 


As in the case of risk measures, capital allocations have been often stud- 
ied from an axiomatic point of view. Denault [2001] defines a set of axioms 
that capital allocations should satisfy. Allocations satisfying some of these 
axioms’ were called coherent allocations therein, in line with the idea of co- 
herent risk measures previously introduced by Artzner et al. [1999]. Let Xi, 
i=1,...,n, be the set of risks and S = X] + +-+- + Xn their sum. Consider a 
risk measure p(S) and capital allocation principles for which K = p(S) and 
Ki, i = 1,...,n are partial contributions K; = p(X; | S) such that p(S) = 


n 
X p(X; | S). The properties that those capital allocation principles should 
i=] 


satisfy are: 


- Consistency (or riskless allocation) 
The capital allocated to a risk that has no uncertainty is equal to the cer- 
tain risk. 
If X; = k for any constant k € R then K; = p(X; | S) = k. 


: Full allocation 


The capital for the overall risk is split into capitals for all individual risks, 
that is 


n 
p(a,X, +-+++ anXn) = > p(aiXi |S), aie R*. 
i=] 


1 No undercut, symmetry and riskless allocation (or consistency). 
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If p satisfies the positive homogeneity property, then this is equivalent to 


n 
p(a,X1 +--+ anXn) = > aip(Xi |S). 
i=1 


Notice that for a; = 1, for all i = 1,..., n, 


of x] =} p(Xil9). 
i=l 


i=l 


- No undercut 
The capital allocation for any decomposition of total risk undercut is not 
higher than the capital amount that it would be allocated as a separated 
portfolio. 
For any subset of N, AC N, where N = {1,..., n}, it is satisfied that 


F Ki=F ocx 9<0(E xi). 


ie A icA icA 


: No diversification 
If there are not diversification benefits for the agents when considered 
together in terms of risk (i.e. if 2? (Xi) = e(S)) then the capital allo- 
cation principle becomes really simple: the stand-alone risk of each agent 
is the amount allocated to it. If 0", 0(Xi) = e(S) then K; = p(X;) forall 
i=l,...,n. 


* Symmetry 
For two identical risks, X; = Xj, i # j, the allocated capital must be the 
same p(X; | S) = p(X; |S). 


* Continuity 
Infinitesimal variations on the whole risk have no impact on risk alloca- 
tions, 


lim p(X |S+h) = p(X; |S). 


This is not a complete list of properties. Other properties as translation in- 
variance, scale invariance or monotonicity with respect to a concordance or- 
dering are often asked to capital allocation principles. The interested reader 
is referred, for instance, to van Gulick et al. [2012]. 
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6.3 Review of some principles 


A collection of particular principles is included in this section. Three partial 
contribution based capital allocation principles are defined and one non- 
proportional capital allocation principle. This section does not pursue to 
be a deep review of principles found in the literature. The principles dis- 
cussed in this section have been selected for exposition purposes. The at- 
tention has been paid to specific issues of each capital allocation principle, 
so a non-homogeneous extension is found in the discussion of each princi- 


ple. 


6.3.1 The gradient allocation principle 


This principle is also known as Euler allocation principle [McNeil et al., 2005] 
or, from a game-theoretic perspective, as Aumann-Shapley allocation prin- 
ciple [Denault, 2001]. According to Tasche [1999, 2004, 2007] capital allo- 
cation principles based on the gradient are the most appropriate allocation 
principles to deal with risk adjusted returns. Since reward on risk and re- 
turn allocations could be specially useful in sound ERM systems, the key 
elements of the gradient allocation principle and its usefulness as reward 
on risk and return allocations are discussed in detail in this section. 

The basic idea that must be remarked is that the gradient allocation princi- 
ple takes advantage of the Euler’s theorem on homogeneous functions ap- 
plied to positively homogeneous risk measures. The definition of homoge- 
neous functions and the Euler's theorem are as follows. 


Definition 6.1 (Homogeneous function of degree r). Let f be a function 
from R” to R, n> 0. 


f ishomogeneous of degree r @ VAER, f(A-i) =A": fl). 


Theorem 6.1 (Euler’s theorem on homogeneous functions). Let f : R” — 
R be a differentiable function on R”. Then, 


z 0 
f is an homogeneous function of degree r > L Ui: of (M=r- fù). 
i=l i 


It has to be noted that a differentiable function f defined from R” to R has 
ð ð ô 
a gradient equal to V f (ú) = (= (ü), oe CD) rsa st 


n 
the right-hand side of the equivalence in Theorem 6.1 can be also written as 
(ù, V f (Ù) = r- f (Ù), where ( , ) stands for the interior product in R”. 


(Ġ) | and, therefore, 
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Consider now a positively homogeneous risk measure p. This means that 
p(A- X) = A- p(X) for all A > 0 and for all X € T, as we have seen in Ta- 
ble 1.2 of Chapter 1. Now, given a random vector X= (Xi, Xo,..., Xn) ET”, 
consider the following function fẹ as well: 


n PPS pos 


SA 


ti s(ii) =} uiXi > p(s) = aps ux =fg@ (6.4) 
i=1 


i=1 


(R+) 


Taking into account Definition 6.1 restricted to (R+) m considering that p is 


n 
a positively homogeneous risk measure, and noting that )} u;X; €T for all 
i= 


ùe (R*) "if X € T”, thenitis deduced that fz isan homogeneous function of 
degree r = 1. In such a case, applying Theorem 6.1 the following expression 
holds: 


fz® = > ui- 


If # = (1,1,..., 1) then the sum r Ui Xi is the sum of all the components 
of the ae vector X. From this point forward, this sum is denoted as S, 
soS=))" | Xj. Ifthe following simplified notation is used 


= (ù, V f (ü). (6.5) 


ô fz 
2o- Ju, Dl (1,1,...,1) » 


then expression (6.5) becomes 


n 
p(S)=)) 5), (6.6) 
i=1 Ou; 
which is the formula usually related to the underlying idea of the Euler allo- 
cation principle. 


Example 6.1. Let W; and W2 be two independent random variables identi- 
cally uae as standard Normal distributions. Consider X; = W; and 
X2= 5 Wy + 8 - W2. Obviously, X; is distributed as a standard Normal dis- 
eabiton and it is easy to check that X? too, although Xj and X2 are not in- 
dependent but there is a linear correlation of 50% between them. Consider 
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now two positive deterministic returns rı > 0 and r2 > 0 and two assets 
whose random returns can be represented by Yj = rı + Xı and Y? = r2+ X2. 
Investing in u > 0 units of the first asset and in u2 > 0 units of the second 
one a portfolio with return t(i/) = u1 - Yj + U2: Y2 is obtained. By construc- 
tion, the return of this portfolio is Normally distributed, with mean equal to 
U : F + U2: r2 and variance equal to 


VED) = E ([e(@ —E(e(@)]*) 


2 
3 
9405-10) Wi +S a | 


= (+m uz + us] E(WÊ) +=- us -E(W2) 
1 3 
rofs u) ua u em We) 


=u + u u+ Us. 


Starting from an initial portfolio of u? units of the first asset and us units of 
the second one, we are interested in allocating the risk faced by this portfolio 
(measured with the VaR risk measure at some confidence level) using the 
gradient allocation principle. Is this possible? 


First thing to be noticed is that positive values of returns t(i) are profits and 
not losses. This is important in order to calculate the risk faced by the port- 
folio, because if a € (0, 1) is the chosen confidence level then we are going to 
compute p(s(i)) = VaRg(—t(W)) or, in other words, the loss function that 
we are going to consider is s(t) = —t() (recall the discussion with respect 
to the ‘liability’ or the ‘asset’ side perspectives provided in Section 1.2.1). The 
second remark is more subtle and it is related to ensure that p(s(i)) is an 
homogeneous function of degree 1 on RÊ, because this is a necessary con- 
dition to be allowed to apply the Euler’s Theorem. In this case, there is no 
problem with p(s(i)), because p = VaRg is a positively homogeneous risk 
measure and s(i) = —U,- Yı — U2 - Y is linear in uy and uz, so homoge- 
neous of degree 1. But note that, for instance, if we change the actual Y by 
Yı = log(u, + 5) + X; then the associated (i) is not homogeneous of de- 
gree 1 and conditions to successfully apply the gradient allocation principle 
are not satisfied. 


Following the notation introduced in expression (6.4) let us compute fp (i) = 
VaR (s(Ġ)) and its gradient V fọ (ú). We have already shown that t(i) is 
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Normally distributed so, recalling Table 1.3 in Chapter 1: 


VaRq(s(u)) = — u1: rı — uz- r2 +O! (a) -4/ u? + U: U2 + uż. (6.7) 


Note that the previous expression is symmetric in u;, i = 1, 2. The gradient 
of VaRq (s(i)) is the vector with the following components 


(eur + u2): y UÑ + Ma U2 + U3 


ôVaRa (s(t) _ = 
uy 2° (uy + U: U2 + u5) (6.8) 
dVaRa(s(i)) “1, Ut 2 sta) yy UÑ + Un u2 + u3 
r2+@ (æ): 7 7 
ðuz 2- (Uy + U: U2 + U5) 


All the elements to derive the capital allocation based on the gradient prin- 


ciple are now available, so with respect to the portfolio consisting in u? units 
of the first asset and us units of the second one and taking into account ex- 


pression (6.5) the final allocation becomes the following: 


VaRa (-u? - Y; |—u®- Yı — u - Y2) 


(2 - (uf)? + u? - u8) - q/ (uP)? + u? - us + (u8)? 


=-uųł. l(a). 
ae Se 2- (u)? + u - u? + (u2)2) 


, 


(6.9) 
VaRa (-u}- Y2|—u?- Yı — u8 - Yo) 


(ul. ul +2- (u8)?)- \/ (ul)? +u? “uy + (u9)? 


Sa 0, om! 
meee E 2-((u9)2 + u? - u? + (u2)2) 


Let us complete the example with some numbers, similar to those that can 
be found in Buch et al. [2011]. Imagine that our starting portfolio consists in 
the combination of u? = = 1.5 units of the first asset and us = 1.7 units of the 
second one, and that the deterministic returns rı and r2 are rı = 46.2098% 
and r2 = 46.3798%, respectively. Additionally, let the confidence level q be 
equal to 99.97%, so ®~!(q@) = 3.4316. The random loss of the portfolio has 
the following expression s(ŭ0) =-1.5-Y,-1.7- Yə and 


E(s(i°)) = —1.5 - 46.2098% — 1.7 - 46.3798% = —148.1605%, 


V(s(H#°)) = (—1.5)? + (-1.5) - (-1.7) + (-1.7)? = 7.69, (6.10) 


VaRg9,97% (5(i°)) = —148.1605% + 3.4316 - V7.69 = 803.4554%, 
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so a really profitable return of 148.1605% is expected (negative losses mean 
gains) and negative returns will not be under —803.4554% in the 99.97% 
of situations. How may this potential risk be allocated to the assets? One 
possibility is to use the gradient allocation principle as derived in expression 
(6.9): 


VaRg9.97% (-1.5- Yı | -1.5- Yı —1.7- Y2) 
7.05 -2.7731 


= —69.3147% + 3.4316 - —————_ = 366.8941%, and 
2-7.69 (6.1) 
VaRg9.97%(—1.7- Yo | —1.5. Yı —1.7- Y2) 
8.33 - 2.7731 
= — 78.8457% + 3.4316- 5769 = 436.5613%. 


Note that, as it may be expected, 


VaRg9.97%(5(0°)) = VaRgo.97% (—1.5- Yı | s(0°)) 
+VaRo9.97%(—1.7 - Y2 |5(i°)) 


(803.4554% = 366.8941% + 436.5613%). 


Another way to interpret the allocation provided in expression (6.11) is that 
the 45.66% (366.8941%/803.4554%) of the risk is allocated to the first as- 
set and the 54.34% of the risk is allocated to the second one. 


Now, it is shown that the gradient allocation principle is the most appro- 
priate principle to deal with reward on risk and return. First, we consider 
a particular Return on Risk Adjusted Capital (RORAC) measure, and some 
concepts and notations taken from Tasche [2007]: 


Definition 6.2. The total RORAC of portfolio S = °?_, X; is defined by 


-E(S 
RORAC(S) = = 
p(S) 
where p is a risk measure and each random variable X;, i =1,..., n is such 


that its positive values represent losses. 


Definition 6.3. Given a portfolio S = 2j Xi and a set of contributions 
p(X; | S), i = 1,...,n, to the value of the risk of the portfolio measured by 
p, i.e. p(S) =O", p(X; | S), the portfolio-related RORAC of each random 
variable X; is defined by 


RORAC(X; | S) = real 
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Note that, in most of the situations, numerators in Definitions 6.2 and 6.3 are 
positive, because the mathematical expectations of S and Xj, i = 1,...,n, 
are negative: it may be assumed that ith business unit does not expect losses, 
so E(X;) < 0 due to the fact that positive values of X; mean losses. A sec- 
ond remark is that the Definition 6.3 depends on both portfolio S and par- 
tial contributions to p(S). Bearing these two previous definitions in mind, 
let us now present the RORAC compatibility definition provided by Tasche 
[2007]: 


Definition 6.4 (RORAC compatible risk contributions). Risk contributions 
p(X; | S),i = 1,...,m are RORAC compatible if there are some €; > 0,i = 
1,..., n such that 


RORAC(X; | S) > RORAC(S) => 


RORAC(S + hX;) > RORAC(S) forall O<h<e; 


According to the definition of RORAC compatibility, if the partial risk and 
return performance of ith agent given by Definition 6.3 is greater than the 
risk and return performance of the overall portfolio given by Definition 6.2, 
then the risk and return performance of the overall portfolio is improved by 
slightly increasing the position of ith agent in the portfolio. In other words, 
if some RORAC(X; | S) is greater than the RORAC(S) and the contribution 
p(X; | S) is RORAC compatible, then the position on ith agent should be 
increased in order to improve the overall performance of the portfolio. 
Assuming that it is possible to slightly increase the position of ith agent in 
the portfolio, going from X; to X;- (1+ h) with h € (0,e;), the necessary 
condition expressed in Definition 6.4 can be understood as equivalent to 
the following one 


ORORAC ž , 
6M) \a=0,1 


>0, 6. 
Bu, 1) (6.12) 


simply by computing 
1 
lim h [RORAC(S + hX;) — RORAC(S)]. 


Taking advantage of expression (6.12) the RORAC compatibility of the gra- 
dient allocation principle can be proved. 
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ôf- 

Proposition 6.1. Suppose that p(s(u)) and 2 Gy for alli = 1,...,n, are 
i 

strictly positive. A gradient allocation principle K € R” of the form K; = K- 

p(Xi S) 


p(S) 
is such that all the risk contributions are RORAC Ris es 


where risk contributions are p(X; | S) = 5) foralli=1,. 


Proof. Let us show that expression (6.12) holds for each i = 1,..., n: 
-E(rn Tee i -X;) 
p(2}-1u:X;) 
Du Z [EE ux) o (E x) 
(Ej: x] 
[-E(EF Xi): xP [Ej 4-%)) 


[Eu] 
2 [e(z u x) ee- em a 


ORORAC ð 


P ——— (stu o= 


[p(s(@)]* 
E(X;) - p(s(@) + [E(s(@)] - an 


| 


[e60]? 
As long as the denominator of the previous expression is always positive, 


ðRORAC E: 
then it is deduced that “au (s(ú)) > 0 if and only if —E(X;) - p(s(w)) + 


i 


ô 
E(s(å))- oli (i) > 0. Consider that both p(s(Ġ)) an 


i 
positive: these conditions may usually hold, because of Daie with risk 


values or risk contributions of a portfolio of risky positions. But note that, in 
fact, these conditions have been required as hypotheses in the proposition. 
Being this the case, this last expression may be written as 


-E(X;) _ —E(s(ŭ)) 


fe. PEO) ` 
“IX iy 
Ou; 


Moreover, when restricted to % = (1,1,..., 1) this last expression provides 
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the following information: 


ORORAC 
au —— (5) a= 


EK) _ -E(S) -E(X;) _ -E(S) 
> <> > 
dp p(S) p(Xi1S)~ p(S) 


1) > 0, ifand only if 


peony 


| 


<=> RORAC(X; | S) > RORAC(S). O 


Example 6.2. Let us continue with the example shown previously in this 
chapter, in order to illustrate RORAC calculations and to check that, when 
using the gradient allocation principle, the RORAC compatibility is satisfied. 
First of all, let us compute the total RORAC of the portfolio (recall expres- 
sion 6.2) and the portfolio-related RORAC of each asset in the portfolio (as 
defined in expression 6.3): 


RORAC(s(#°)) = 
u? "ri + us "T2 


—u? - rı — u? - r2 + ©! (q) - (ud)? + ut. u? + (ud)? 


RORAC(-u? - Y; | 8(i°))) = 
u? “Ti 


(2- (ud)? + u9- ud) UHE (613) 


2- (u)? + u? - ud + (u8)?) 


-ul -rı +07! (a)- 


RORAC(-u9 - Y» | (2°) = ; 
Uy *T2 


(U - U8 + 2- (U8)?) -y (U9)? + u -u8 + (U9)? 


—ul. o-l 
nee ee 2- (uo)? + u? - u? + (u2)2) 


Considering the aforementioned values of u9, u, rı, r2, and q, the previous 
expressions lead to the following results: 


1.5 - 46.2098% + 1.7 - 46.3798% 


— 1.5 - 46.2098% — 1.7 - 46.3798% + 3.4316 - 2.7731 
= 148.1605%/803.4554% 


= 18.4404%, 


RORAC(s(i°)) = 
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1.5-46.2098% 


RORAC(-1.5- Yj | s(@))) = 


7.05- 2.7731 
—1.5 - 46.2098% + 3.4316 - ———____ 
2-7.69 
= 69.3147%/366.8941% 
= 18.8923%, 


1.7 - 46.3798% 


RORAC(-1.7 - Yo | s(@°))) = 


8.33 -2.7731 
—1.7 - 46.3798% + 3.4316 - ———_—_. 
2-7.69 
= 78.8457%/436.5613% 
= 18.0606%. 
(6.14) 


These results are providing us the following information: on the one hand, 
although it is expected that the current position on the second asset would 
generate a higher return (near 79%) than the current position on the first 
one (69%), the trade-off between risk and return measured by the portfolio- 
related RORAC is better for the position on the first asset than for the posi- 
tion on the second one. Additionally, the portfolio-related RORAC of the 
position on the first asset (18.8923%) is higher than the RORAC of the over- 
all portfolio (18.4404%) which, in turn, is higher than the portfolio-related 
RORAC of the position on the second asset (18.0606%). As we know that 
within this allocation the RORAC compatibility is satisfied, a natural strat- 
egy to improve the RORAC of our current portfolio would be slightly increas- 
ing our position in the first asset and shorting our position in the second 
asset (also with a slight decrement of the position). For instance, it can be 
checked that if positions (ul, us) are changed to be (1.56, 1.69) instead of 
(1.5, 1.7), then the RORAC of the overall portfolio becomes 18.4479% (an 
improvement is achieved). 

This is a straight application of the RORAC compatibility property of RO- 
RAC contributions as stated in Definition 6.4. Note that this is not contra- 
dictory with the fact that an optimal portfolio (in terms of RORAC) would 
be found following a different strategy (for instance, increasing current posi- 
tions in both assets at the same time). Why? Because the strategy of increas- 
ing both positions may be decomposed in several sub-strategies (steps) in 
which increasing or decreasing positions on the assets depend on the recal- 
culated portfolio-related RORACs. Going back to the numerical example, af- 
ter changing our portfolio to 1.56 units of the first asset and 1.69 units of the 
second one, it can be checked that RORAC(—1.56- Yj | s(i#°)) = 18.7133% 
and RORAC(-1.69- Y> | s(#°)) = 18.2104%. To improve the overall RORAC 
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of the portfolio we could now decide to change again our positions, increas- 
ing to 1.69 our position on the first asset and freezing the position on the 
second one. By doing this, we would obtain an overall RORAC for our new 
portfolio equal to 18.4521% (higher than the 18.4479% obtained with the 
first change of positions). But now RORAC(-1.69- Y; | 5(@)) = 18.4120% 
and RORAC(—1.69: Y> | 5(#°)) = 18.4923%, which would drive us to make 
another change: we would increase our position on the second asset to, for 
example, 1.71 units and let the position on the first one equal to 1.69. If we 
do that then the RORAC of the overall portfolio slightly raises to 18.4522%. 
Is this portfolio with (ul, us) = (1.69, 1.71) optimal in terms of RORAC? We 
do not know, but its overall RORAC (18.4522%) is higher than the original 
one (18.4404%) and, as we have shown, original positions on both assets 
have increased. 


This example has been adapted from Buch etal. [2011]. The interested reader 
can be found there examples showing the sub-optimality, in terms of the RO- 
RAC of the overall portfolio, of some strategies devised by properly applying 
the RORAC compatibility property that the contributions derived from the 
gradient allocation principle satisfy. 


Some final comments on the gradient allocation principle. This elegant ap- 
proach to proportional capital allocation principles based on partial contri- 
butions has two main drawbacks when they are applied on a real context. 
On the one hand, infinitesimal (or very small) perturbations on the risky 
position of an agent can often not be made in practice. Frequently, it is not 
feasible to perform arbitrarily small changes in positions. As a consequence, 
the compatibility of RORAC contributions should be barely satisfied even 
for the risk contributions linked to the gradient allocation principle. 


The second limitation is related to the computation of risk contributions 
(3 fz/ðui) (ii), where the value of the risk measure p for sums ṣ(ù), Ù € R” 
cannot be expressed in an analytic closed-form expression. This is, probably, 
the most frequent practical situation. In most of the cases, some decisions 
must be taken in order to approximately do an allocation based on the gra- 
dient. For instance, Tasche [2007] shows how the risk contributions of X; 
to the VaRq of the portfolio can be approximated using kernel estimators. 
With this respect, it has to be noted that this situation is extremely similar 
to the one depicted in Figure 1.2 in Chapter1, but now considering on the left 
hand side (Theory) a gradient allocation principle K and in the right hand 


side (Practice) the effective estimation of that capital allocation principle K. 
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6.3.2 Other capital allocation principles based on partial 
contributions 


There are other examples of proportional capital allocation principles based 
on partial contributions fitting expression (6.2). Two examples are given 
here, one from a probabilistic perspective and another one from a game- 
theoretic perspective. 


The covariance allocation principle 


This principle is proposed, for instance, in Overbeck [2000]. It takes into 
account the variance as the risk measure for the whole portfolio: p(S) = 
V(S). The partial contribution of the ith agent X; is the covariance of X; 
with respect to S, so p(X; | S) = Cov(X;,S). Therefore, this principle is 
expressed as 

Cov(Xi, S) 
VS) 


Note that p(S) = Li p(X; | S) because of the (bi)linearity of the covari- 
ance: 


Ki=K , Wi=l,..,n. (6.15) 


Cov(X;,5) = }) p(X; |S). 


p(S) = V(S) = Cov(S, S) = con Xj, s| = 
j=1 j=1 


n 
j=1 j= 
From the perspective of the Euler’s Theorem on homogeneous functions, 
this principle can be understood in two different (but related) ways. The 
first interpretation considers as the risk measure p the variance in expres- 
sion (6.4), in order to interpret the covariance principle similarly to a gradi- 
ent principle. The resulting function fz = Vos is not an homogeneous func- 
tion of degree r = 1 but an homogeneous function of degree r = 2, because 
the variance is not a positively homogeneous risk measure but satisfies the 
following relationship: for all A € R and for all X € T, V(A- X) = 4? - V(X). 
From Theorem 6.1 this means that expression 


ov (£r UK: Xx) 


n n 
2-V| >) uj-X;|= u (6.16) 
j=l i=l ĝui 


holds or, in other words, that 


1 V (Zk uk: Xx) 
2 Ou; 


= Cov(X;, S). 


Lew. iets 1) 
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Let us check this last equivalence: 


ð n ð n n 
Ju V 2 uy Xj =a Y 2 Uj: Xj, Dd) Uk Xk 
j=l l j=1 k=1 


i 


n 
Uj: oox; $ w xi} 


~ Ou; jal 
n n 
Cov(X;, X 
-2 La“ -ug Cov(Xj, Xy) 
ð 
~ Bu; Lu up :Cov(X;, Xx) 


+) > Uuj: Uz Cov(X;, Xx) 


j#ik=1 
ô 
u$ - Cov(X;, Xi) + È ui- up Cov(X;, Xy) 
~ Ou; k#i 
+ L L uj: up: Cov(X;, Xx) 


jžikži 


+) uj mesCones X) 

i#i 

=2-uj-Cov(Xj, Xi) +2: J ug: Cov(X;, Xx) 
k#i 

= 2-Cov(Xj,5()) 


If last expression is evaluated at % = (1,1,...,1) then the desired result is 
found. 

The second interpretation allows to understand the covariance allocation 
principle as a pure gradient allocation principle as explained in Section 6.3.1. 
The key is to consider as the risk measure p in (6.4) the covariance of a ran- 
dom variable with respect to the sum S of the components of X instead of 
the variance. So the function fẹ is taken as fy = Cov(-,S)os. As long as 
Cov(A - X, S) = A-Cov(X, S) for all A € R and for all X €T, fẹ is an homo- 
geneous function of degree r = 1 and Theorem 6.1 may be applied in this 
case as in Proposition 6.1. 


Example 6.3. Let us apply the covariance principle to obtain an allocation 
linked to the portfolio of Example 6.1. 
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The first step is to recall the expression for the variance of (s(i)) and to de- 
rive the expression for the homogeneous (covariance) function: 


V(s(ii)) = u? + Uy Un + US, 


Cov(- u1: Y1, 5(0)) = E [(~ u1 - Yı - E(- u1 - Yi)) - (6) - E@())) 


=E 


2 2 


1 3 
w1y-Xi-[-an m= Foti 9] 


_ 32 
= Uy + g U1: Ue, 


Cov(— up: Y2,5(0)) = E [(- u2: Yo — E(- u2: Y2)) - (S(ŭ) — E(6(W)))] 


1 V3 1 V3 
EF} |->: u2: X) — —- U2: Xo || -u Xy — =: U2: Xi- — +2: X: 
| z 2 ee ee | 1&7 M2 A y | 


= 5 Ma U+ U5. 


If (u?, u9) = (1.5, 1.7) then V (s(i%°)) = 7.69, Cov (—1.5 - Yj,5(@°)) = 3.525 
and Cov (—1.7 - Y2,5(i@°)) = 4.165. In relative terms, the covariance alloca- 
tion principle is assigning a 45.84% of the risk (measured by the variance) 
to the first asset and a 54.16% to the second one. 


Finally, some comments on strengths and weaknesses of the covariance prin- 
ciple may be pointed out. As a strength in front of other gradient allocation 
principles, estimators of both V(S) and Cov(X;,S) for all i = 1,...,m can 
be found satisfying that the sum of the estimated covariances add up to the 
estimated variance of the overall portfolio, whatever the set of random vari- 
ables {X;};=1 
drawback commented at the end of the previous section. As a weakness, the 
allocation only takes care of linear dependence structures between random 
variables X;, i = 1,..., n, and may lead to negative allocated capitals K;. The 
article of Wang [2014] is inspired by the covariance allocation principle and 


n is. Hence, the covariance principle overcomes the second 


josep 


the tail variance risk measure presented in Furman and Landsman [2006]. 
In this work the author define the capital allocation principles based on the 
Tail Covariance Premium Adjusted and tackles a possible non linear depen- 
dence between business lines. 


The Shapley value principle and one ofits simplifications 


Another proportional allocation principle based on partial contributions 
can be derived from game theory. The capital allocation problem can be 
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understood as a cooperative game in which capital K has to be fairly shared 
by the agents, taking into account that the cost of a coalition is linked to the 
risk that this coalition assumes. The key concept to find such a fair alloca- 
tion is the Shapley value (sometimes also called Bondareva-Shapley value). 
Let us use the following notations: N = {1,..., n}, A S N denotes a subset 
of N with cardinality a = | A| and (A) = p (x kea X. k). A capital allocation 
principle based on the Shapley value is of the form (6.2), where 


a!l- (n—-a-— l1)! 
X OPOE 


p(X; |S) = -[Z(AU {i} -R(A)]. (617) 


AcN~{i} n! 


Note that %(N) = p(S). Additionally, it can be proved that p (S) = "| p (Xi 
| S) using the properties of the Shapley value. The contribution of ith agent 
to the overall risk is, basically, a weighted average of all the marginal contri- 
butions that ith agent makes on the risk of each of the coalitions that can 
be obtained without ith agent. This principle can require a high computa- 
tional demand for obtaining each p(X; | S) if n is large. 

In order to avoid this drawback, some authors propose an alternative ap- 
proach, that is a simplification of this principle. In Balog [2010] this alter- 
native is called incremental principle. It is built by reducing the number of 
terms added up in expression (6.17) only to the one linked to the set N ~ {i}. 
In other words, the incremental principle is of the form (6.1) where 


fx =@80) BUN ~()= (9-0 EX) i Hlaicath 
j#i 


This alternative principle assigns as partial contribution of ith agent the dif- 
ference between the overall risk and the risk quantified in absence of the ith 
agent. To some extent, this principle can be considered as a hybrid between 
a proportional principle based on partial contributions and a stand-alone 
proportional principle. This principle cannot be considered a proportional 
principle based on partial contributions, because Xia fi(X;) # p(S). But, 
at the same time, some relationship between ith agent and the rest of partic- 
ipants is taken into account by fj, so it cannot be considered a stand-alone 
proportional principle. The loss of information is the price that must be paid 
to reduce the computational cost of the Shapley value for large n. 


Example 6.4. Let us illustrate the Shapley value principle. We will consider 
three random variables, two of which are identical. They can only take four 
possible values. So, for the following states (Ww ,W2,W3,W4), random variables 
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Xı, X2 and X; are defined as follows: 


w1 60 w1 w1 3 
w2 30 w2 W2 30 
X 7 at a =X = 
1 w3 0 $ W3 3 W3 —7.5 
w4 -15 w4 w4 15 


The corresponding probabilities are: 


w1 pı 1/10 
P W2 be! P2 ie 1/10 

W3 P3 2/5 

w4 p4 2/5 


This information can be displayed in a table, where S = Xj + X2 + X3 and 
the other partial sums can also be computed. 


Q P Xı Xə X3 Xı + Xi=23 Xo + X3 S 


wı 1/10 60 3 3 63 6 66 
w2 1/10 0 30 30 30 60 60 
w3 2/5 30 -7.5 -7.5 22.5 -15 15 
w4 2/5 -15 15 15 0 30 15 


The reader can easily compute the survival function for each of the random 
variables. Let us show how to compute the probability distribution function 
and the survival function for S, namely the sum of the three initial random 


variables. 
Q P S Fs Ss 
w3 2/5 15 4/5 1/5 
w4 2/5 15 4/5 1/5 
w2 1/10 60 9/10 1/10 
w1 1/10 66 1 0 


The question is now to illustrate how to perform the allocation of a risk 
measure based on the Shapley principle. Let us consider the TVaRg5% (S). 
Then, the following measures are needed: TVaRg5%(X1), TVaRg5%(X2), 
TVaRg5%(X3), TVaRgs5%(X1 + Xi=2,3), TVaRg5%(X2 + X3) and, obviously, 
TVaRg5% (S). 
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Recall (see, Chapter 1, eq. (1.4)) that an expression for the TVaRg (Z), for a 
random variable Z, is: 


TVaRgq(Z) = VaRg(Z) + — -ESq(Z). (6.18) 


Then since, VaRg5% (S) = 60 and 


4 1 6 
ESg5%(S) = )_ (sj — VaRgs%(S))+- pj = (66-60): — = —, (619) 
fa 10 10 
it follows that 
100 6 60 
TVaRg5% (S) = 60 + —— -— = 60+ — = 64. 6. 
85% (S) TET 15 (6.20) 


Similarly, TVaRg5% (X1) = 50 and TVaRgsy (X2) = TVaRg5% (X3) = 25, while 
TVaRg5%,(X1 + Xi=2,3) = 52 and TVaRg5%(X2 + X3) = 50. 

Since all the necessary values of the risk measure p = TVaRgsy are ready, 
then two tables are constructed in order to obtain p(X, | S), 0(X2 | S) and 
p(X3 | S). Let us first concentrate on X4. 


A EX a a SE X+ E Xy AA AAv i) 
keA 


keA 
Ø A] 0 1 2/6 Xı 0 50 
{2} X2 1 1 1/6 Xi + X2 25 52 
{3} X3 1 1 1/6 Xı + X3 25 52 
{2,3} X2+X3 2 2 1/6 S 50 64 


Using expression (6.17), it follows that: 


2 1 1 2 
PAT al= U +a G2 Sat g Oe ank z 04 50) 


1 
=30+- 
3 


Using a similar procedure, it follows that p(X2|S) = p(X3 | S) = 16+ 5/6. 
Therefore, p (S) = es p(X; | S) = 64. 
6.3.3 The excess based allocation principle 


The last principle explained in this chapter is the principle proposed in van 
Gulick et al. [2012]. The reason is twofold. On the one hand, because of its 
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originality and, on the other hand, because ofits non-proportionality. Taken 
the authors’ own words [cf. page 29]. 
“The allocation rule that we propose determines the allocation that lexicograph- 
ically minimizes the portfolio’s excesses among a set of allocations that satis- 
fies two basic properties. First, no portfolio is allocated more risk capital than 
the amount of risk capital that it would need to withhold if it were on its own. 
Second, a portfolio is not allocated less than the minimum loss it can incur”. 
To better understand this principle, the following definition from van Gulick 
et al. [2012] must be presented. 


Definition 6.5 (Lexicographical ordering). For m € N and any two vectors 
X, YER", X is lexicographically strictly smaller than y, denoted as X <jex J, 
if there exists an i < m such that x; < yi, and for all j < i it holds that x; = 
yj. Moreover, X is lexicographically smaller than y, denoted by X <jex J, if 
X= Vor X <ļex VJ. 


The authors considers that the capital K to be shared among the agents is, 
in fact, equal to p(S), where p should be a coherent risk measure. They use 
notation N = {1,..., n}. Once these preliminaries are established, the idea 
of the excess based allocation principle may be outlined in four steps: 


(i) Consider any capital allocation principle K such that X ‘= Kj = p(S) 
and such that the following boundary conditions are satisfied for all 
i € N: max{0,mingeo Xi(w)} < Ki < p(Xi). The set of all the prin- 
ciples satisfying these conditions is called the set of feasible principles, 
denoted as ¥. 


(ii) Compute, for each feasible principle, the vector of dimension 2” con- 
sisting in (K) = (E|(jea(Xj- Kp), 
nent for each subset A © N. This component is equal to the mathe- 


. So, there is a compo- 
ACN 


matical expectation of the random variable that represents the non- 
negative excess of capital that principle K assigns to coalition A. 


(iii) For each feasible principle K, the components of @(K) are ordered in 


a 


a decreasing manner. The ordered resulting vector in R” is denoted by 


6[é(K)]. 


(iv) The excess based allocation principle, denoted by Kgga, is the feasi- 


~~ 


ble principle which lexicographically minimizes the 0[@(K)]. In other 
words, Kgga is chosen among all feasible principles as the principle 
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associated to the first position in the set of ordered 9[@(K)], suppos- 
ing that this order is similar to the one provided by a librarian who has 
been increasingly ordering vectors 0 [ē(Ř)] alphabetically. 


Obviously, last comment on step (iv) is not formal. A more precise way to 
present the excess based allocation principle is by 


Krpa={Ke§| Ole] <iex [@(C)], VCes}, (6.21) 


taking into account that the set at the right-hand side of expression (6.21) is 
a single value set [as proved in van Gulick et al, 2012] and that, therefore, 
there is a simplification of notation when identifying a set consisting on a 
single element with that element. 

Although this perspective on the allocation procedure is very interesting, as 
in the case of the capital allocation principle based on the Shapley value, 
this principle can involve a significant computational cost when n is large. 
In addition, as long as the coherence property is required to the risk measure 
p to be used in the EBA principle, it has to be noted that VaR could not be 
used as p in that allocation procedure. 


Example 6.5. The same random variables that were defined in the previous 
example (6.4) are used here to illustrate the excess based allocation princi- 
ple, using TVaRg5%. 

The following steps are needed: 


a) Conditions to be fulfilled by each feasible principle. 
b) Calculation of vectors é(K). 

c) Ordering the components of @(K), to obtain 0[é(K)]. 
d) Finding the excess based allocation. 


a) A feasible principle K = (Kı, K2, K3) has to fulfill the following condi- 
tions: 


Condition1. K = Kı + Kə + K3. Since K = TVaRg5y%,(S) = 64, then Kı + 
Kə + K3 = 64. 


Condition 2. K2 = K3, because the excess based allocation principle satis- 
fies the symmetry property. 
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Condition 3. Condition max{0, minoco X;(w)} < K; < o(X;) in this case 
correspond to 0 < Kj < 50 and 0 < Kọ < 25 (similarly, 0 < K3 < 25). 
Since by Condition 1 Kz = Kz = (64 — Kj)/2 it follows that (64 — 
Kı)/2 = Ko < 25 then 14 < Kı. 


Moreover, no sub-portfolio should be allocated more capital than its 
own risk. Therefore, Kı + Ko < p(Kı + K2) = 52 and then, since Kp = 
(64 — Kı)/2, it follows that Kı + (64 — Kı)/2 < p(K, + K2) = 52 and 
so, Kı < 40. Then, the final conditions are: 


14< Kı <40 
_ 64-K, 


Ko = Ky 5 


b) Vector &(K) has eight components, corresponding respectively to Ø, {1}, 
{2}, {3}, {1, 2}, {1, 3}, {2, 3} and {1, 2, 3}. It can be written as follows: 


0 

(60—K,)/10 if Kı <30 
(36—K,)/2 if Kj <30 
(Kı —4)/20 if Kı <34 
(Kı — 28)/4 if Kı <34 
(Kı —4)/20 if Kı <34 
i (Kı — 28)/4 if Kı <34 

(62 — Kı)/20 

(62 — Kj)/20 


(Kı —4)/10 if Kı <34 
(Kı —28)/4 if Kı <34 
1/5 


ē(Ř) = 


Note that the vector can be parametrized in terms of Kı. 

c) Ordering depends on the values of Kı. Therefore, for instance if Kı = 
14, then the decreasing order is, in terms of the subsets of N, {1} > {1,2} = 
{1,3} > {2,3} > {2} = {3} > N > Ø. 

The ordering is different for each of the following intervals of Kı, namely 
[14,23 + 1/3], [23 + 1/3, 32], [32,33], [33,260/7] and [260/7,40]. Due to 
linearity, we only need to concentrate on the extremes of the previous inter- 
vals in order to find the candidates for the excess based allocation principle. 
d) To finalize the computation of the allocation principle, we need to choose 
the principle that minimizes lexicographically the 0[@(K)]. We will first 
choose the principle in each interval and then find the overall choice. So, 
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In the interval Kı € [14,23 + 1/3], the minimum is found for {1} and it is 
given by Kı = 23+1/3,so Ky14,23+1/3] = (2341/3, 20+1/3, 20+1/3), 
because Kz = (64 — Kj)/2. Moreover, @(Kj14,2341/3)) = (6.3, 1.93, 
1.93, 1.93, 0.96, 0.96, 0.2, 0). 


In the interval Kı € [23 + 1/3,32], the minimum is at {1} and it is given 
by Kı = 32, so Kj2341/3,32) = (32, 16, 16). Here 6(Kj23+1/3,32)) = (2.8, 
2.8, 1.5, 1.5, 1.4, 1.4, 0.2, 0), corresponding respectively to {1} > {2, 
3} >{1, 2}={1, 3} > {2} = 13> N> Ø. 


In the interval K; € [32, 33], the minimum is now at t2, 3} and it is given 
by Kı = 32, so again Kisz, 33] = (32, 16, 16) and ē(Kis2, 331) = (2.8, 2.8, 
1.5, 1.5, 1.4, 1.4, 0.2, 0), corresponding respectively to {2, 3} > {1} > 
1, 2} = {1, 3} > {2 = 13> N> Ø. 


In the interval K; € [33, 260/7], the minimum is at {2, 3} and it is given by 
Kı = 33, so again Kis3,260/7) = (33, 15.5, 15.5) and ē(Kis3,260/7)) = 
(2.9, 2.7, 1.45, 1.45, 1.45, 1.45, 0.2, 0), corresponding respectively 
to {2, 3} > {1} > {2} = {3} > {1, 2} = {11,3 > N > Ø. 


In the interval Kı € [260/7,40], the minimum is at {2, 3} and it is given by 
K; = 260/6, so again Kj260/7,40) = (37.14, 13.43, 13.43) and 
ē(Ki260/7,40]) = (4.57, 2.286, 2.286, 2.286, 1.2426, 1.2426, 0.2, 0), 
corresponding respectively to {2, 3} > {2} = {3} > {1} > {1, 2} = {1, 3} 
> N> Ø. 


In the previous allocations, we seek the lexicographical minimum, which 
corresponds to @(Kj23+1/3,32)) = ē(Řis2,331) = (2.8, 2.8, 1.5, 1.5, 1.4, 1.4, 
0.2, 0). As a result, the excess based allocation principle is: 


K = (Kı, Ko, K3) = (32, 16, 16). 


Note that in van Gulick et al. [2012], the authors recommend an optimiza- 
tion procedure, which has not been implemented in the previous example. 


6.4 Further reading 


There is a large number ofacademic works related to capital allocation prob- 
lems. An extensive literature can be found discussing solutions to capital 
allocation problems [see, among others. Denault, 2001; Kalkbrener, 2005; 
Tsanakas, 2009; Buch et al., 2011; van Gulick et al., 2012]. A number of re- 
cent studies focus on specific probability distributions of losses [Cossette 


AN OVERVIEW ON CAPITAL ALLOCATION PROBLEMS 111 


et al., 2012, 2013], risk dependence structures [Cai and Wei, 2014], asymp- 
totics of capital allocations based on commonly used risk measures [| Asimit 
et al., 2011] or modifications of the optimization function to overcome lim- 
itations of allocations based on minimizing the loss function [Xu and Hu, 
2012; Xu and Mao, 2013]. More precisely, You and Li [2014] analyze capi- 
tal allocation problems concerning mutually interdependent risks, mainly 
where they are tied through an Archimedean copula. Wang [2014] investi- 
gates the usefulness of the Tail Covariance Premium Adjusted principle in 
the case of two business lines with exponentially distributed losses, where 
their dependence structure corresponds to a Farlie-Gumbel-Morgenstern 
copula. Zaks and Tsanakas [2014] generalize the framework proposed in 
Dhaene et al. [2012b], allowing the inclusion of different hierarchical levels 
of preferences about risk in the final solution. In Urbina and Guillen [2014] 
several principles are examined to solve a capital allocation problem related 
to operational risk. This list of academic contributions on capital allocation 
problems is not exhaustive. In fact, this topic is object of ongoing research. 
Two recent contributions are Tsanakas and Millossovich [2016] and Li and 
You [2015]. 


6.5 Exercices 


1. Compare the allocation in percentage to each risk, obtained in the ex- 
amples corresponding to the Shapley allocation principle and the excess 
based allocation principle. 


2. Compute the RORAC for the risks described in Example 6.5. 


3. Compute the gradient allocation principle which was illustrated in Ex- 
ample 6.1, but now using a different level a = 95%. You should discuss 
the consequences on that principle of diminishing the confidence level. 


4. Consider again Example 6.1 and change the risk measure to another one, 
for instance TVaR. Compare the results with the ones obtained for the 
VaR. 


7 Capital allocation based on 
GlueVaR 


In Section 6.1 of the previous chapter a set of elements to fully describe a cap- 
ital allocation problem were identified. Nonetheless, two of those elements 
are of main importance: the assignment criterion and the functions used to 
simplify the information provided by each random loss. So, one could think 
that guidelines about how capital should be shared among firm’s units are 
basically defined in terms of two components: (1) a capital allocation crite- 
rion and (2) a risk measure. The choice of the specific form for each com- 
ponent is essential as different capital allocation solutions result from the 
specific selected combinations. 

In this chapter we consider the framework suggested by Dhaene et al. [2012b]. 
Under this framework, capital allocation principles are interpreted as solu- 
tions to optimization problems. This approach has been followed in the re- 
cent literature [see, for instance You and Li, 2014; Zaks and Tsanakas, 2014]. 


71 A capital allocation framework 


Most of the proportional allocation principles can be described in the frame- 
work suggested by Dhaene et al. [2012b]. Under this unifying framework 
a capital allocation problem is represented by means of three elements: a 
non-negative function (which is usually linked to a norm), a set of weights, 
and a set of auxiliary random variables. However, the Haircut allocation 
principle could not be fitted into this framework despite its simplicity: the 
Haircut allocation principle combines a stand-alone proportional capital al- 
location criterion with the classical Value at Risk. 

Here, the extension of the framework due to Dhaene et al. [2012b] is de- 
scribed. This was suggested in Belles-Sampera et al. [2014b]. A slight modi- 
fication of the original framework was proposed, consisting in relaxing some 
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of the conditions in order to allow the inclusion of the Haircut capital allo- 
cation principle. 

Assume that a capital K > 0 has to be allocated across n business units de- 
noted by i = 1,...,. Any capital allocation problem can be described as 
the optimization problem given by 


min, br efe o()| s.t. TEK, (7.1) 


K,,Ka,.. n 


with the following characterizing elements: 
(a) afunction D:R— R+; 


(b) a set of positive weights v;, i = 1,..., n, such that ))""_, vj = 1; and 


(c) a set of random variables ¢;, i= 1,..., n, with E[¢;] < +00. 


Unlike the original framework provided by Dhaene et al. [2012b], a distinc- 
tion is made in (c) so that each ¢; is now no longer forced to be positive with 
each E[¢;] equal to 1. 


To conclude, there exist a relationship between this capital allocation frame- 
work and aggregation functions. Aggregation functions may be defined as 
solutions to optimization problems, as proposed in De Baets [2013]. Capi- 
tal allocation problems are disaggregation problems and therefore, to some 
extent, the goal of capital allocation principles is the opposite of the goal of 
aggregation functions, which is a summarizing purpose. Nonetheless, the 
optimization perspective taken into account in expression (7.1) involves ag- 
gregation operators in the objective function. For instance, one can think 


n Xj -Kj 
ee 
j=1 Kij 
composition of two “ae aggregation operators: one aggregation operator 
X;—K; 

J J 


of the function E to be minimized in (7.1) as the 


is given by expression 3 vj: D and the other one is the math- 
j=1 
ematical expectation E. It has to be noted that a similar perspective is pro- 


posed in Xu and Hu [2012], where the first aggregation function may be rep- 
resented as Y(L(K)) = ey w(X; — K;)), where y is a function usually 
linked to a distance and Y an increasing function (which could be the iden- 
tity function, for instance). 
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7.2 The Haircut capital allocation principle 


Following the modification that we proposed in Belles-Sampera et al. [2014b], 
the Haircut capital allocation solution can be obtained from the minimiza- 
tion problem (7.1). If a capital K > 0 has to be allocated across n business 
units, the Haircut allocation principle states that the capital K; to be as- 
signed to each business unit must be 


Fz (a) 
k= ke Viz=1,...,n, (7.2) 
E Fola) 
ja * 


where X; is the random loss linked to the ith business unit, F * is the in- 
verse of the cumulative distribution function of X; and a@ € (0, 1) is a given 
confidence level. 

Let us consider d; = min {d > 1 | 0 <|M®@[X;]| < +oo} for all i = 1,..., n, 
where MÎ[X;] = E[X@] is the moment of order d > 0 of random variable 
Xi. Note that di > 1 for each i to face a feasible capital allocation problem. 


In other words, if a business unit presents a random loss with no finite mo- 
ments, then the risk taken by that business unit is not insurable/hedgeable. 
The approach for fitting the Haircut allocation principle in the framework 


linked to the optimization problem (7.1) can be summarized as follows: if a 
dj-1 


constant r; must be expressed as r; = E[¢;-X;], then using ¢; = -Tia 


Mí [Xi] 
solution is found because E[¢ ;- Xj] = E[(x@ Ma [X;])] -ri = rj. Although 


this is an elegant approach, the interpretation of the transformation made 


by ¢; on X; is intricate. 


Proposition 7.1. Let us consider a confidence level a € (0,1). Then, the three 
characterizing elements required to represent the Haircut allocation principle 
in the general framework defined by (7.1) are: 


(a) D(x) = x’, 
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Proof, In this setting, it is straightforward to show that the solution K = 
(Kı, Ko,..., Kn) to the minimization problem (7.1) is the Haircut allocation 
solution expressed by (7.2). Dhaene et al. [2012b] show that, if function D is 
the squared Euclidean norm (D(x) = x’), then any solution to (7.1) can be 
written as 


n 
Kitt xiso |K- Ù : i Xi) forall i=1,...,n. (7-3) 
j=l 


In this setting, vj = E[¢;- Xill Xia E[¢ j - Xj] for each i, so 


pai EG- X)=K- El¢i- Xi] l 
È Elg;- Xj] El¢ j: Xj] 


j=l 


Ki = Elči: Xil +K- 


1 


TMs 


And, finally, for all i it holds that E[¢;-X;] =F x (a) because of (c). There- 


fore, each K; in the solution K is given by 


Fy! (a) 

K; =K.— e 

Fw 
ja” 


O 


Some comments on v; weights and ¢; auxiliary random variables follow. 
Capital allocation principles driven by (7.3) can be thought of as two step 
allocation procedures: in a first step, a particular quantity (C; = E[¢; - X;]) 
is allocated to each business unit. As the sum of all these quantities does 
not necessarily equal K (i.e., i C; # K), in the second step the differ- 


ence (x — Xi C i] is allocated to the business units considering weights 
v;. From this perspective, C; capitals are expected values of X; losses re- 
stricted to particular events of interest and, therefore, ¢; auxiliary random 
variables are used to select those events of interest for each business unit. 
On the other hand, v; weights are related to the second step of the proce- 
dure, indicating how the difference between K and )° jel C; must be dis- 
tributed among business units. For a deeper interpretation of v; weights 
and ¢ ; auxiliary random variables in more general cases, the interested reader 
is referred to Dhaene et al. [2012b]. 

A remark on the gradient allocation principle. This principle can be fitted 
into the framework introduced by Dhaene et al. [2012b] following a similar 
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ô 
strategy than the one in Proposition 7.1, but changing F l(a) b 2P (S) for 
gy P ging iy, y ðu; 
l 


all į = 1,..., n. Or, in other words, 


di—1 
i: Xi X;" ô 
D=, n= ÉE% and u-( sig} Zo 


no ditx.)| ð 
¥ Ell; Xj Sr ee 
j=l 

forall i = 1,..., n. Therefore, we find again that the gradient allocation prin- 


ciple is a proportional principle based on partial contributions, although we 
have now used a side track to arrive to this conclusion. 


7-3 Proportional risk capital allocation principles 
using GlueVaR 


The three characteristic elements of the framework suggested by Dhaene 
et al. [2012b] are function D, weights v; and a set of appropriate ¢;, for 
all i = 1,...,m. According to the notation used by Dhaene et al. [2012b], 
we deal with business unit driven proportional allocation principles when 
¢; depends on X;. If č; depends on S = 2o Xi then we have aggregate 
portfolio driven proportional allocation principles. In the former case, the 
marginal risk contributions of business units to the overall risk of the port- 
folio are not taken into account; in the latter, they are. Adopting the no- 
tation introduced in the previous chapter, principles belonging to the first 
category are here denoted as stand-alone proportional allocation principles 
while principles in the second category are denoted as proportional alloca- 
tion principles based on partial contributions. 

In this chapter, two GlueVaR based proportional capital allocation princi- 
ples that we suggested in Belles-Sampera et al. [2014b] are presented. Both 
principles share the expressions for two of the three characterizing elements: 


El; X; 
D(x) = x? and vi = ee al i=l,...,n. 
£ E[o;- Xj] 
j=l 
They differ in the set of random variables ¢;, i = 1,...,, which are pre- 


sented below for the case of continuous random variables X;. 
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7-3-1 Stand-alone proportional allocation principles using 
GlueVaR 


Given two confidence levels a and £ in (0, 1), a < p, and two distorted sur- 
vival probabilities hı and ho, if Č; is fixed as 


1 [x> FG) 1 |X; > Fy @ 
= i eee et 
(j=) 1-6 w2 E 
di—1 
+w3-—!—_- F; (a), forall i=1,...,n, 
3 Mix * (7.4) 


then the stand- eon proportional allocation principle using as risk mea- 
sure the GlueVaR”’” can be represented in the modified capital allocation 
framework explained i in Section 7.1. Components of the solution (Kj, Ko,.. 
Kn) are expressed as 


GlueVaR i!" (Xi) 
Kij=K-— i , forall i=1,...,n. (7.5) 
X GlueVaR 5!" (X;) 


7-3-2 Proportional allocation principles based on partial 
contributions using GlueVaR 


Simay if there exists a confidence level a* € (0,1) such that F z (a) = 


z 1 Fy (a*), the proportional allocation principle based on paral con- 


tributions using GlueVaR mha can be fitted to the modified capital allocation 
framework detailed in Sechan 7.1. In this case, ¢; has to be equal to 


= 1[S>F51(p)] 1[S>F51(a)] 
Ora ae 
= 
+W3° PLIES “Fx (a*), forall i=1,. (7.6) 


Each component of the solution (Kj, K2,..., Kn) is then obtained as 


E[X;|S>F51(p)| ; E[X; | S> Fz™(a)] 
w2: 
GlueVaR g! "i (S) GlueVaR g (S) 


+w: — ~]. (7-7) 
GlueVaR ji” (S) 


Ki=K.- |w: 
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Alternatively, another approach can be considered. There exists a set of con- 
fidence levels a; € (0, 1), forall j = 1,...,n, such that Fz (a) = oA Fy, (aj). 
Therefore, the proportional allocation principle based on partial contribu- 
tions using GlueVaR gi”? can also be fitted to the modified capital allocation 
framework. In this case, ¢; have to be equal to 


1[S> F;'(p)] : 1 [S > Fz (a)] 


w 
1-6 i l-a 


dj-1 


Gi =@1- 


+03 -Fz (ai), forall i=1,...,n. (7.8) 


i 
M4@[Xj] 


Each component of the solution (Kj, K2,..., Kn) is then obtained as 


E[X;|S>F51(p)] E[X;|S2F5"(a)] 
: +w 


Kiı=K: |ø hh; 2° hh 
GlueVaR Ba 2 (S) GlueVaR Bi 2 (S) 
Fx" (ai) 
+ W3*— 7 |. (7-9) 
GlueVaR Ba 2 (S) 


A final comment related to non-proportional capital allocation principles 
using GlueVaR. It has to be mentioned that it is possible and straightfor- 
ward to obtain non-proportional principles using any of the auxiliary ran- 
dom variables ¢; described in expressions (7.4), (7.6) or (7.8). If function 
D(x) = x”, then the only thing that must be taken into account is that at 
least one of the weights v;, i = 1,..., n, must be different from E[¢; - X;]/ 


( z 1 Elg ;-X;]). Under these restrictions, whatever set of auxiliary random 

variables ¢ is chosen among expressions (7.4), (7.6) or (7.8), non-proportional 
capital allocation principles K using GlueVaR are obtained through expres- 

sion (7.3). 


74 An example of insurance risk capital allocation 
using GlueVaR on claim costs 


Data of previous chapters are used to illustrate the application of capital 
allocation principles based on GlueVaR risk measures. Table 7.1 shows risk 
values for this example. The last column presents diversification benefit, 
which is the difference between the sum of the risks of X1, X2 and X3 and 
the risk of X + X2 + X3. In this example, VaRg5% and one of the GlueVaR 
risk measures are not subadditive in the whole domain. 
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Table 7.1 Risk assessment of claim costs using GlueVaR risk measures 


Xı Xə Xz X,+X2+X3 Difference? 
(a) (b) (c) (d) (a)+(b)+(d)-(c) 


VaRo5% 25 06 11 59 -1.7 
TVaRgs% 12.5 8.0 13 19.7 2.1 
TVaRg9.5% 40.8 42.1 1.8 81.1 3.6 
GlueVaRgo szosy 18.6 16.9 1.4 35.6 1.3 
GlueVaRgo sy osy 94 42 1.2 12.9 1.9 
GlueVaRgo ss osm 49 2.9 11 10.2 -1.3 


©) Benefit of diversification. 


Next, a capital allocation application is illustrated where total capital has to 
be allocated between the three units of risk, X1, X2 and X3. Table 7.2 shows 
particular allocation solutions for two proportional risk capital allocation 
principles using GlueVaR. 

A different pattern is observed for the three GlueVaR risk measures when the 
stand-alone criterion or the partial contribution criterion is considered. In 
the case of the stand-alone criterion, the capital is allocated primarily to risk 
Xı, followed by Xz and X3, respectively. Let us focus on capital allocation 
solutions involving the partial contribution criterion in which confidence 
levels aj, j = 1, 2, 3, are not forced to be equal across the risk units. A no- 
table increase in the risk allocated to X2 is observed if a partial contribution 
criterion with no constant level q j and GlueVaR jg 4s. ae is chosen’. 

This result is obtained because the impact on the quantile of X2 is the op- 
posite of that on Xı and X3 when aj, j = 1, 2,3, are estimated as Be 
(95%) = Fzt (a) + Fy) (a2) + F} (a3). These confidence levels are equal 
to A, = 26%, a@2 = 98% and a3 = 43%. So, the associated quantiles for in- 
dividual variables are VaR26% (X1), VaRogy (X2) and VaR43y%(X3). The risk 
contribution of X; and Xz are underweighted compared to the risk contri- 
bution of X2. If we interpret the GlueVaR risk measure as a linear combina- 


1 The partial contribution criterion with constant level is not calculated in this example. 
However, there is a @* = 95,42% such that VaRg5% (Z) = Da VaRg5,42% (Xj). 
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Table 7.2 Proportional capital allocation solutions using GlueVaR for the claim 


costs data 


Proportion Proportion Proportion 
allocated to allocatedto allocated to 


Xı X2 X3 
Stand-alone criterion 

11/30,2/3 

GlueVaRjy Soas 50.41% 45.80% 3.79% 
0,1 

GlueVaR9o 5, 95% 63.51% 28.38% 8.11% 
1/20,1/8 

GlueVaR gq 5% 95% 54.44% 32.22% 12.22% 


Partial contribution criterion with non constant ™ a j 


Glue VaR ae oe P 46.42% 51.74% 1.84% 
GlueVaRS o ou, osu ® 68.19% 26.86% 4.95% 
Glue VaR go ony P 25.11% 73.11% 2.78% 


‘® Confidence levels aj € (0,1) are selected to satisfy Fz! (95%) = Fy) (a1) + 
Fy} (a2) + Fy} (œs). In this case @1 = 26%, a2 = 98% and a3 = 43%. 


tion of w1 -TVaRgg.5% + w2 : TVaRg5% + w3 - VaRg5%, the associated weights 


of the GlueVaR}y 07 Geo, are w1 = 1/24, w2 = 1/12 and w3 = 21/24. So, the 


GlueVaR} 20%] sa reflects a risk measurement attitude just a bit more con- 
070, a] 


servative than VaRg5%, giving the largest weight to this risk value. Bearing 
in mind the quantitative tools that we have proposed in Chapter 5 to assess 
aggregate risk attitudes, the latter statement is reinforced by the following 


fact: the area under the distortion function of GlueVaR4 (20? Ra is 


1 14+99.5% 1 1+95% 21 
. + . + -95% = 0.042 + 0.081 + 0.831 
24 2 12 2 24 


= 95.4% 


which is, effectively, slightly higher than 95%, the size of the area associated 
to VaRgs%. 
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7.5 Exercices 


1. Consider two risks that are Normally distributed with means, u; and u2, 
both non-negative and covariance matrix S. Write a program to imple- 
ment the GlueVaR stand-alone allocation principle assuming that the 
inputs are the distribution parameters and, in addition, «œ, p, w1, @2 and 
w3. 


2. Assume n random variables, each one is uniformly distributed in the in- 
terval [0,100]. Consider the GlueVaR stand-alone allocation principle, 
where w1 = w2 = w3, express the result of the allocation to each compo- 
nent in terms of n, a and $. 


3. Assume n random variables, each one is uniformly distributed in the in- 
terval [0,100]. Consider the GlueVaR stand-alone allocation principle, 
where q = 0.95 and $ = 0.99, express the result of the allocation to each 
component in terms of n, w1, w2 and w3 and discuss the particular cases 
when w1 = l or W2 = 1. 


4. Consider the example described in Section 7.4 and consider a change of 
monetary units, which means that each variable is multiplied by a con- 
stant E > 0, where E is the exchange rate. How would that modification 
affect the capital allocation results if nothing else changes? 


8 Capital allocation principles as 
compositional data 


In Chapter 6 it was shown that given an (absolute) capital allocation princi- 
ple K with K = Li Kj, its relative counterpart is defined as X, where com- 
ponents are x; = K;/K. This chapter is devoted to show that relative capital 
allocation principles can be understood as belonging to the (standard) sim- 
plex. Following a nomenclature often used by geologists, any vector of the 
simplex is called a composition and any set of vectors in the simplex is called 
compositional data. This chapter first presents the metric space structure of 
the simplex. Secondly, it is shown that it is possible to move forward and 
backwards from relative capital allocation principles to compositions and 
the opposite. Applications of this relationship are illustrated with the data 
set used all along this book. This chapter is based on the study that we car- 
ried out in Belles-Sampera et al. [2016a]. 


8.1 The simplex and its vectorial and metric structure 


Let us define the (standard) simplex Z” = {Z € R” | zj > 0, j =1,...,n, 

n 
Bi 
Any vector Z € P” is a composition and a set of vectors is called composi- 


Zj = 1} provided witha particular structure of vector and metric space. 


tional data. We need to define the vector space and to enrich it later with 
a distance in order to be allowed to talk about this vector space as a metric 
space. Any set of vectors needs two operations (often called vector addi- 
tion and scalar multiplication) in order to be called a vector space over R. 
These operations must satisfy a set of particular properties. The vector ad- 
dition must be commutative, associative, and a neutral element is needed. 
Moreover, for each vector, its additive inverse must exist. A scalar multipli- 
cation for a vector space over R combines a real number with a vector and, 
whatever this combination is, it is necessary that the combination must be- 
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long again to the set of vectors. Additionally, a neutral element for the scalar 
multiplication must exist, and the distributivity of the scalar multiplication 
with respect to the vector addition and, on the other side, the distributivity 
of the vector addition with respect to the scalar multiplication must be both 
satisfied. 

Following the notation provided in Aitchison and Egozcue [2005], a vec- 
tor addition called perturbation (denoted by ®) and a scalar multiplication 
called powering (denoted by ©) may be attached to the set Z”. These oper- 
ations are defined by expressions (8.1) and (8.2), respectively, where X, y € 
F” andAecR: 


TRS xı: yı Xn’ y 
soye p], (8.1) 
j=1%j Y] ja“ Yi 
A A 
> xy Xn 
ros=| ae (8.2) 
July j=1> j 


It can be proved that the simplex .Y” provided with operations ® and © has 
a linear vector space structure of dimension n — 1. An important function 
in the context of compositional data is the closure function, 6. The closure 
function applied to a vector in R” returns another vector whose compo- 
nents are the components of the original vector divided by the sum of all 
the components of the original vector. Keeping this in mind, the following 
expressions hold: 


XOV=E [x1 yn. Xn Vn)| , Aot=6|(x},...,xA)]. 


Moreover, assuming the vector space structure of (/”, ®, ©), the neutral 
element 0 of ® can be deduced. Given a vector X such that x; > 0 for all i, 
the relationship ¥ @ 7 = 0 informs that F is the inverse of X with respect to 
the perturbation operation, so it should be written as F = (—1) © X. In other 
words, 

1/x 1 1/Xp 


f= —— z...) = 
| j=l! x9) LET] 
Then, using this last expression and (8.1), 
> 1 1 1 1 
j=%er=% oe =(5,..2), 
(ate Sr n n 


so the neutral element 0 of the perturbation operation is the composition 
with all of its n elements equal to 1/n. 
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Figure 8.1 Example of perturbation (addition) and powering (scalar 
multiplication) in P? 


P? 


Example 8.1 (Perturbation, powering and neutral element in 7”). We pro- 
vide an example in a low dimension (7n = 2) in order to illustrate how the 
vector spaces (Y”, ®, ©) work. Consider X = (1/3,2/3) and y = (3/4, 1/4) 
in SF”, and A = 1/2 € R. We can ask ourselves for X ® y, AO xX and 0 


ees F 5 1 z] È =| 
XOy= ) = , , 
4 12 6 12 55 


v1/3 V2/3 
A © ž = | ——— ., ——_ | = (0.4142, 0.5858) , and 
V1/3+ V2/3 V1/34+ 72/3 
= E s] 
O=|-,-]. 
262 
= 1111 1 2 _ 
For instance, 260=[ ; js{ : a 
62 32 3 3 


All these vectors are displayed in Figure 8.1. 


Finally, a distance is needed in order to consider the vector space (F ”, ®, ©) 
as a metric space. The simplicial metric defined in Aitchison [1983] is here 
considered. Given two compositions X, y, the distance between them from 
the point of view of the simplicial metric is 


i yo yi 2 1/2 
SMS È il aa) l | pe) 
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where GM(Z) denotes the geometric mean of the components of Z vector, 
n l/n 

I] zi 

i=1 


An equivalent expression for A(X, y) is the following: 


this is GM(Z) = 


(8.4) 


This simplicial metric is linked to a norm ||- ||, and to an inner product (, )a 
in a usual way. Given two vectors X, ye 2”, 


A(X, Y) = 1X © Vila = y (XOY,XOY)g, 


where že Y= že [(-1) © y], and 


(8.5) 


Example 8.2 (Level curves in ?). Once the distances are defined, we can 
explore — for instance — the geometrical locus of all those elements in the 
simplex with the same distance to a given element in that simplex. In other 
words, we could be interested in determining a sort of level curves in S” 
related to the distances of compositions x € SY” to a fixed composition yo. 
Each one of these level curves would be driven by a certain d € R*, the target 
distance. Formally, we could look for geometrical loci denoted Icg(Vo) and 
defined by 


lca (Vo) = {XE OS | A(x, Yo) = d}. 


In Figure 8.2 several level curves are represented in a two dimensional pro- 
jection of S*. On the left, Icq(0) for d = 0.2, 0.45, 0.8 and 1.0 are repre- 
sented. As it is observed, the higher the distance to the composition of ref- 
erence (in this case, Vo = 6 = (1/3, 1/3, 1/3)) the sharper the curve. Differ- 
ences with respect to the Euclidean distance are evident, because these level 
curves have not circular shapes with center in the composition of reference. 
The behavior is similar on the right hand side of Figure 8.2. In that case, the 
composition of reference is Yo = (1/8, 1/2,3/8) instead of the neutral ele- 
ment with respect to the perturbation (addition). The corresponding level 
curves lcg(yo) for d = 0.2, 0.45, 0.8 and 1.0 are once again represented but 
with this alternative reference. 


CAPITAL ALLOCATION PRINCIPLES AS COMPOSITIONAL DATA 127 


Figure 8.2. Examples of level curves lcq (Yo) in FP? are displayed. In fact, they are 
projections in R? of the geometrical loci lcg (Vo) of the elements which distances to 


point jo are equal to d, where d = 0.2, d = 0.45, d = 0.8 and d = 1.0. 
Left figure: Yo = (1/3, 1/3, 1/3) (neutral element with respect to the perturbation). 
Right figure: Yo = (1/8, 1/2, 3/8). 


Under this framework, as it was shown in De Baets [2013], the simplicial 
arithmetic mean of the compositional data X1, X2,..., Xm may be understood 
as a solution of a minimization problem, in the following way: 
1 m m 
AMa is.. Ëm) = — © D ik= agile EAN (8.6) 


m 
where © Xx means the perturbation ofthe set of m compositions {Xx} k=1,...,m- 
k=1 
At first sight, this expression is equivalent to the arithmetic mean of m real 


numbers u1, U2,..., Um: 


m m 
AM(it1,...)4m) = —- Ý ug = argmin Y lv —ugll3, (8.7) 
M REI v k=l 

so, the simplicial metric presented in this section is the natural metric choice 
if (simplicial) arithmetic means are computed. In other words, the expres- 
sion (8.6) contains the proper definition of the arithmetic mean of X1,...,Xm 
in the metric space (Y”,®, 0, A). From the definitions of both perturba- 
tion and powering operations, an explicit expression for the simplicial arith- 
metic mean presented in (8.6) is 


AMa (%1,...,Xm) = @[(G,...,Gn)], (8.8) 


where Gk = GM(X},k, X2,k» -< -» Xm,k), Le. Gk = [E xi”, for all k = 
hst: 
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8.1.1 From capital allocation principles to compositional data 


An absolute capital allocation K has its relative counterpart X computed as 
xi = K;/K forall i = 1,..., n. Note that it is satisfied that Li xj = 1. Note 
also that when negative allocated capital amounts K; are allowed, the rel- 
ative components would be negative and then x ¢ 2”. For the rest of the 
chapter it is assumed that X has strictly positive components. That is, we as- 
sume that X is a composition with non-zero and non-negative components. 
This assumption allows to avoid negative or zero values on components of 
X, which are an inconvenient for practitioners (negative allocations) and 
when operating in the simplex (null compositions)’. 

At this point, some concepts introduced in Chapter 6 to classify absolute 
capital allocation problems can be associated to concepts introduced in this 
chapter. For instance, if we consider proportional capital allocation princi- 
ples as stated in expression (6.1), the relative counterpart Y of the absolute 
principle K = (K1, ..., Kn) may be interpreted as the closure of the vector 
with components equal to fi (Xi), i= 1,..., n: 


K st. Kj = K-———— re 
L fj (Xj) (8.9) 


When stand-alone proportional principles are considered, the previous ex- 
pression helps to visualize why dependence structures between random vari- 
ables {X;};=1,...,.n are not taken at all into account in the capital allocation 
solution. In a first step, the amount of risk faced by each agent is assigned 
to one of them, which is summarized by f;(X;), i = 1,..., n. Subsequently, 
the relative risk proportion obtained in that manner is scaled by K to obtain 
the final capital allocation. 


8.2 Perturbation inverse, simplicial distance and 
simplicial arithmetic mean applied to capital 
allocation problems 


In the previous section, it has been shown that relative capital allocation 
principles and compositions may be naturally linked. Once this relationship 


1 Although elements ofa composition can be equal to zero, dealing with compositions with 
null components is not an easy task in practice. 
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is established, the idea is to take advantage of the geometric structure of the 
simplex to enrich the description of each capital allocation principle and 
each capital allocation result. Some applications of compositional methods 
in the context of capital allocation problems are shown in this section. 


8.2.1 The inverse of a capital allocation 


Let us consider a relative capital allocation principle X linked to what we 
have called in Chapter 6 a cost of risk goal. A manager would want to depart 
from this allocation to distribute rewards instead of costs in order to fulfill 
an allocation with a reward to an objective linked to minimisation of risk. 
An intuitive idea is to invert each of the relative components, in order to 
reflect the inverse nature of the allocation (a relative low cost allocated to 
ith agent should mean a relative high reward assigned to him). To proceed 
in this direction, one must normalize the sum of all 1/ x; in order to provide 
a full allocation of the whole amount of capital, K. Note that the inversion of 
the components is only feasible if all components of X are different from 0. 
This application has a natural interpretation in the simplex Z”. The nor- 
malization can be understood as the application of the closure function. 
Given a relative capital allocation principle X, let 7 be the closure of the 
vector with components 1/x; for i = 1,..., n. As it has been shown in Sec- 
tion 8.1, F is the inverse of X with respect to the perturbation operation: 
r=(-1)0xX. 
Using risk based capital allocation principles to determine penalizations or 
rewards may lead to undesirable behaviors of the agents. Basically, agents 
have incentives to take conservative business decisions because less risk re- 
sults in a better reward. In order to prevent it, a return-on-risk measure 
seems to be preferable to assign rewards. It has been indicated in Chap- 
ter 6 that rewards on risk and return allocations may be of great relevance 
for a sound ERM system. 
Note now that there are some direct absolute reward on risk and return cap- 
ital allocation principles that can be considered. For instance, if we depart 
from a given X = © [7], where yj = RORAC(X; | S)/RORAC(S), for all 
t= lyss Iik 
Then, we obtain the absolute capital allocation principle K by 

RORAC(X; | S) 


K,j;=K-xj=K- , Vi=l,...,n. 
i ' L”, RORAC(X; | S) 


The underlying idea is to give a higher reward to those agents whose rel- 
ative RORAC with respect to the overall RORAC of the portfolio is higher. 
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Note that different definitions of return-on-risk measures than expressions 
(6.3) and (6.2) in Chapter 6 for RORAC(X; | S) and RORAC(S) may be con- 
sidered, and the objective of the allocation would not change. 


8.2.2 Ranking capital allocation principles 


We have presented a simplicial metric or distance A which helps to con- 
stitute SY” as a metric space. A can be used to quantitatively rank capi- 
tal allocation principles. Let us consider the neutral composition 0 € A” 
which is the composition with all of its n components equal to 1/7. So, the 
distance between any relative capital allocation principle ¥ and 6 can be 
computed. Alternatively, the distance between any pair of relative capital 
allocation principles belonging to Y” can be calculated. Both uses of the 
simplicial distance are useful to compare different capital allocation princi- 
ples in a quantitative manner. 

When the distance between the relative capital allocation % and 0 is com- 
puted, a quantitative result shows how far the allocation principle is from 
a neutral assignment. Note that 0 € 2” is linked to a capital allocation 
principle in which no matter how much risk each agent faces, they would 
all receive the same since the same amount is allocated to each one (K/n). 
On the other hand, if an allocation principle is taken as a reference (for in- 
stance, a gradient allocation principle as explained in Section 6.3.1 of Chap- 
ter 6), the distance between the composition linked to this principle and any 
other composition quantifies how far this principle is from the allocation of 
reference. 

Imagine that four allocation principles are in hand for the same amount K 
of money and the same n agents: a haircut allocation principle (7.2), Kn; a 
covariance allocation principle (6.15), Ke; a stand-alone proportional allo- 
cation principle based on GlueVaR (7.5), Kyanda gradient allocation prin- 
ciple related to (6.5), Kg. If their respective relative allocation principles 
Ýn Ýc, Xs and Xg are in /” and each ofthe components of X;, t € {h, c, s, 8} 
is strictly positive, then it is possible to rank them in two different ways: 


1) Compute A(X;, 0) for t € {h,c,s, g}. Order distances in an increasing or- 
der. A higher order position indicates an allocation located further apart 
from the neutral allocation; 


2) Choose one of the principles as reference (for instance, the gradient allo- 
cation principle). Compute A (z, Xg) for t € {h, c, s}. These three values 
are quantifying how far each principle is from the allocation of reference. 
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8.2.3 Averaging capital allocation principles 


In practice, different management teams may suggest different capital allo- 
cations regarding the same assignment problem. The situation sketched at 
the end of the previous section could be an example of such a situation. In 
those cases, we want to stress that the set of different capital allocation prin- 
ciples can be aggregated through the simplicial arithmetic mean, obtaining 
a final allocation that considers each one of the available viewpoints. 
Formally, let us imagine m management teams providing m absolute capital 
allocation principles Kj. of amount K to the same n agents, and let X; be the 
relative capital allocation principles linked to Ri, k=1,...,m. Once again, 
taking advantage of the geometric structure of S”, the concept of averaging 
the m points of view on the same allocation problem is easily derived. In 
other words, the expression Z = AM4 (X1,...,Xm)is the proper definition 
of the arithmetic mean of X1,...,X in the metric space (.Y”,®, 0, A) as 
it was shown in (8.6). Once the relative arithmetic mean is obtained, what 
remains to do is to assign an amount of Kj =K-z; monetary units to each 
ith agent, i = 1,...,n, in order to provide a capital allocation principle in 
the adequate scale. This principle balances the opinions of all the involved 
management teams. 


8.2.4 An illustration 


In order to illustrate the applications described in this section we are get- 
ting back to the relative principles obtained in Chapter 7, which where dis- 
played in Table 7.2. Recall that these relative principles were derived from 
six absolute proportional allocation principles (6.1) based on three differ- 
ent GlueVaR risk measures but with two different perspectives: on the one 
hand, stand-alone proportional allocation principles (7.5) and, on the other 
hand, partial contributions based proportional allocation principles (7.9). 
Let us name them as X;, i = 1,...,6. Then 


Xı = (50.41%, 45.80%, 3.79%), 
X2 = (63.51%, 28.38%, 8.11%), 
X3 = (54.44%, 32.22%, 12.22%), 
X4 = (46.42%, 51.74%, 1.84%), 
X5 = (68.19%, 26.86%, 4.95%), 
Xe = (25.11%, 73.11%, 1.78%). 


(8.10) 
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Table 8.1 Perturbation inverse relative capital allocation principles. 


Xı X2 X3 

Stand-alone 
fi =(-NoOx, 6.50% 7.15% 86.35% 
jo =(-DOX 9.03% 20.22% 70.75% 


Ys = (-1 © X3 14.00% 23.65% 62.35% 


Based on partial contributions 


Ya = (-1) OX, 3.68% 3.30% 93.02% 
Ys = (-1) © %5 5.78% 14.67% 79.56% 
Ys = (-1) © X6 6.48% 2.22% 91.30% 


From x} to x3 the results correspond to stand-alone proportional allocation 
principles and the rest may be understood as proportional allocation prin- 
ciples base on partial contributions. In addition, x; and x4 are linked to 


11/30,2/3 > > 0,1 n 
GlueVaR gq 5o% “95% X2 and X5 to GlueVaRg9 5% 95% and the remaining were 
Ri/20,1/8 


calculated based on GlueVaRgg 5% 959: 
ples belong to the simplex Z? and have non-zero components. 


Note that all these relative princi- 


Assume now that risk managers are interested in allocation principles with 
a reward to conservative objctives (meaning the smaller the risk figure the 
better the type of risk). However, the only available information (principles 
Xi, i =1,...,6) isa set of capital allocation principles with a cost of risk goal. 
The computation of the inverse of relative capital allocation principles can 
be useful in this context, in order to obtain principles driven by a reward 
to a risk minimization objective. So the perturbation inverses of relative 
capital allocation principles x;, i = 1,...,6, are shown in Table 8.1 and they 
are denoted as yj, i=1,...,6. 


Using the relative principles displayed in (8.10), relative allocation princi- 
ples are ranked according to Section 8.2.2. As before, symbol 0 is used to 
refer to the neutral allocation. The following simplicial distances are calcu- 
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lated from expression (8.3): 


A(X, 0) = V0.8 + 0.637 + 2.865 = 2.074, 
A(X2,0) = V0.911 + 0.022 + 1.218 = 1.4669, 
A(x3,0) = V0.453 + 0.022 + 0.674 = 1.0719, 
A( 0) = V1.083 + 1.32 + 4.795 = 2.6831, 
A(X, 0) = V/1.404 + 0.064 + 2.068 = 1.8803, 
A(X, 0) = V0.276 + 2.542 + 4.495 = 2.7045. 


(8.11) 


Figure 8.3 Distances between capital allocation principles. 


aah eee 


3.0740. 
ae x6 
xo X3 x2 X5 | | x1 
P0000 LOI 2a 
Eo 1.8803 


Distances considered individually are not too informative. However, these 
values allow to rank the principles with respect to one principle of reference, 
as it is graphically shown in Figure 8.3. From results (8.11), it can be deduced 
that, in this example, proportional allocation principles based on partial 
contributions are generally more distant from the neutral allocation than 
the rest (with x) being the only exception). Additionally, when comparing 
pairs of compositions with the same risk measure involved, it becomes evi- 
dent that the most different behavior is the one linked to principles depend- 


ing on GlueVaR}/7®1/8 risk measure (the pair £3 and ¥). So, in this exam- 


99.5%,95% 

; 1/20,1/8 
ple, when using GlueVaR}y 5% i 
location criterion (stand-alone versus based on partial contributions) seems 


y asa risk measure, the selection of the al- 


to be more relevant than when using the other two GlueVaR risk measures 
under consideration. 

As a final application, the three relative stand-alone allocations displayed 
in (8.10) are averaged by means of the simplicial arithmetic mean. The rel- 
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ative principles based on partial contributions are also averaged. Finally, 
the simplicial arithmetic mean of these previous averages is obtained, just 
for the sake of mixing both perspectives in one single principle following a 
hierarchical approach in which the distribution criterion (as mentioned in 
Section 6.1 of Chapter 6) plays an important role. 

With respect to the stand-alone proportional allocation principles, the geo- 
metric means of the three components (n = 3) of the three (m = 3) relative 
capital allocations, denoted as G1, G2 and G3, respectively, are computed. 
Their values are 


G} = (50.41% - 63.51% -54.44)!/3 = 55.86%, 
G3 = (45.8% - 28.38% - 32.22)! = 34.73% and 
G3 = (3.79% -8.11% - 12.22)!/3 = 7.22%. 


Following expression (8.8), we calculate the value of @ [(G1, G2, G3)], i.e. the 
closure of the vector with components being the geometric means G1, G2 
and G3. By doing so, the value of the simplicial average AMa (<1, X2, X3) is 
obtained which is a relative allocation principle. Similarly, the simplicial 
arithmetic mean of the relative principles based on partial contributions 
AMa (X4, X5, Xe) is also obtained. Both results are shown in Table 8.2. 


Table 8.2 Simplicial means of the capital allocation principles 


Xı X2 X3 
AM1 (X1, X2, X3) 57.11% 35.51% 7.38% 
AM1 (X4, X5, X6) 46.64% 50.60% 2.74% 


As it can be proved, the components of the simplicial averages are not equal 
to the arithmetic mean of the components of the original principles. In fact, 
the components of the simplicial average are linked to the geometric mean 
of the components of the original relative principles. As a final result, let us 
average the two principles displayed in Table 8.2. In this case, the geometric 
means of the three components (n = 3) of the two (m = 2) relative capital 
allocations, denoted as G}, G and G}, respectively, are computed. Their 
values are 


Gi = V57.11% -46.64% = 51.61%, 


CAPITAL ALLOCATION PRINCIPLES AS COMPOSITIONAL DATA 135 


G, = V35.51% -50.62% = 42.39% and 
G} = V7.38% - 2.74% = 4.5%. 


In order to obtain the final result, we need to calculate the closure of iù = 
(G, G3, G3). This is 


AMa (AMa (X1, X2, X3), AMa (Xa, X5, X6)) =€ [Ww] . 


This relative principle is 


(52.40%, 43.04%, 4.57%). 


Final remark. Another feasible approach to reach this unique allocation 


would be to calculate the simplicial arithmetic mean of the whole set of rel- 


ative principles shown in (8.10). It has to be noted that the result would 


certainly be different, because this last approach would lack the specific ag- 


gregation hierarchy that we have imposed herein. 


8.3 Exercises 


1. 


In the example presented in Section 8.2.4, calculate the simplicial dis- 
tance between AM, (AMa (x1, Xo, X), AMa (X4, Xs, X6)) and AM, (x1, 
X2, X3, X4, X5, Xe) where X;, i = 1,...,6, are the relative principles shown 
in (8.10). 


. Observe the distances between principles shown in the example pre- 


sented in Section 8.2.4, find a stand-alone allocation which is located be- 
tween Xi and Xo. 


. Assume an allocation principle in a situation of n different sources that 


assigns an allocation equal to 2i/(n(n + 1)) for the risk i, i = 1,...,n. 
Show that this is not the neutral allocation and find the distance to this 
element as a function of n. Calculate the behaviour of this distance as n 
increases. 


. In the same situation as in the previous exercise, propose another allo- 


cation for i, i = 1,..., n, different to the neutral allocation and compare 
it to 2i/(n(n + 1)) for the risk i, i=1,...,n. 


. Assume a situation of n different sources that assigns an allocation equal 


to 2i/(n(n+ 1)) for the risk i, i = 1,..., n, find an average allocation that 
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summarizes the allocations provided by this one and the neutral alloca- 
tion. Find the average allocation now also with the third allocation that 
you proposed in the previous exercise. 


Appendix A 


A.1 Equivalent expression for the GlueVaR distortion 
function 


Details on the definition of the GlueVaR distortion function mo (u) asa 
linear combination of the distortion functions of TVaR at confidence levels 
p and a, and VaR at confidence level q are provided, i.e. an explanation 
of how to obtain expression (3.3) can be found here. Expression (3.1) of the 
distortion function (u) can be rewritten as, 


Kee) =h,-yp(u)-1[0<u<1- | 


ħa- hy, h — hy 
+ og ee A . 
1[1-p<u<l1-a]+yau) (A.1) 


where 1 [x] < u < x2] is an indicator function, so it takes a value of 1 if 
u € [x1, X2) and 0 otherwise. 
Note that 

ygu)-1[0< u< 1- pfp] = ygu) -yp , (A.2) 


11- p<u< 1-a] = ypu) -Yalu (A.3) 


Yalu) -1f1-Bsu<l-aj= 


{= 
Yalu) — Yalu) [RE]. Iroon -wpol (A.4) 


Taking into account expressions (A.2), (A.3) and (A.4), expression (A.1) may 

be rewritten as, 

(ho-hi): 0-8]. 
p-a 


hy, ho = 
K5 a (u) = 


hı ypu) + 


138 RISK QUANTIFICATION AND ALLOCATION METHODS FOR PRACTITIONERS 


(ho-hi): 0-6)  (ha—hy)-(1- B) 
[=m + in 2 E a 2 E P| ypt 
hħ2-hı 
-(1-d)-Ya A. 
Bag he) Yalt (As) 
(hg—hy)-A-f)  he-hy 
= ee ES -(l-a@)|-We(w. 
| hi + p-a p-a ( æ) | -Walu) 
Given that w1 = hı (ha = ħi) (=p) W2 = ham - (1 — q) and w3 = 
p-a p-a 
1 — hg, expression (3.3) follows directly from (A.5). o 


A.2 Bijective relationship between heights and 
weights as parameters for GlueVaR risk measures 
Pairs of GlueVaR heights (hı, h2) and weights (w1, w2) are linearly related 


to each other. The parameter relationships are (h1, hz)! = H- (w1, w2)' and, 
inversely, (w1, w2)! = H`! - (hy, h2)', where H and H~! matrices are H = 


1-6 l-a fB-1 
| l-a Juar- a E , respectively. 
1 1 wie 
-a p-a 


A.3 Relationship between GlueVaR and Tail 
Distortion risk measures 


This section of the appendix is intended to present the proof of Proposition 
4.1. Following the notation introduced along this work, as for any random 


variable X it holds that GlueVaRg y” (X) = f Xduwith u= Kea oP and 


Tg,a(X) = f Xdn with n = ga © P, proving Proposition 4.1 is equivalent to 


proving that Kea = Yq under the proper conditions on w1, w2 and g. 
On one hand, suppose that w2 = 1 — w and that g is given by expression 
(4.2). First of all, let us rewrite g as 


wi: (1-4) 
1-6 
(w +@2:0-1[d-a)!-(-f)<t<1] 


si =( +w? -t-1[0<t< (0-a). a- p)]+ 


Recall that gq is built as g(;4)-10 <u<l-aj+l[l-a<u< 1]. If 
u is less than 1 — £ therefore t = 7% is less than (1 — a)~!- (1 — p); if u is 


APPENDIX A 139 


comprised between 1 — 6 and 1 — a, then t = ;“, satisfies that (1 — a)7!. 
(1— 6) < t <1. Summarizing, 


w w 
i a -u if O<u<1-8 
= w 
Salu) = wı + ? u if l1-Bs<su<l-a (A6) 
1 if l-a<u<l 


which is the definition of distortion function x% ba * as shown in (3.5). 
On the other hand, consider as starting point the aforementioned expres- 


sion (3.5) of K rae As pointed out, gą is always continuous in 1 — œ. Con- 


sequently parameters of Kea ‘2 must be such that e continuity of the 
equivalent gq in 1 — æ. In other words, lim; (1—@) Ke 2 (U) = w + W2 = 
1=limy|(—a) Ke A 2 Cu). This is exactly condition is =< w1. Now, forc- 


ing ga = Kgg » itis straightforward to go backwards from expression (A.6) 
to expression (4.2) to complete the proof. O 


@),W2 
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